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METHOD FOR PURIFYING ICgRATINOCYTE GROWTH FACTORS 

Field of the Invention 

5 The present invention relates to the field of 

protein purification. Specifically, the present 
invention relates to the field of purifying Jceratinocyte 
growth factors . 

io BacfrqrQvuifl <?f the Invention 

Polypeptide growth factors are important 
mediators of intercellular communication (Rubin et al. 
(1989), Proc. Natl. Acad. Sci. USA, ££:802-806). These; 

15 molecules are generally released by one cell type and 
act to influence proliferation of other cell types . 

One family of growth factors is the fibroblast 
growth factors (FGF) . There are currently eight known 
FGF family members which share a relatedness among 

20 primary structures: basic fibroblast growth factor, 
bFGF (Abraham et al. (1986), EMBO J. . 5:2523-2528); 
acidic fibroblast growth factor, aFGF (Jaye et al. 
(1986), Science, 211:541-545); int-2 gene product, int-2 
(Dickson & Peters (1987) , Nature. I2£:833); hst/kFGF 

25 (Delli-Bovi et al. (1987), Cell, 5£:729-737, and 
Yoshida et al. (1987), Proc. Natl. Acad. Sci. USA, 
M:7305-7309) ; FGF-5 (2han et al. (1988) , Mol. Cell. 
Biol., 1:3487-3495); FGF-6 (Maries et al. (1989), 
Oncogene, A:335-340) ; keratinocyte growth factor (Finch 

30 et al. (1989), Science, 24:752-755; Rubin et al. (1989), 
Proc. AMI. .Acad. Sci. USA, S£:802-806; Ron et al. 
(1993), The Journal of Biological Chemistry, 
2£fil41:2984-2988; and Yan et al. (1991), In Vitro Cell. 
Dev. Biol., 27&: 437-438) ; and hisactophilin (Habazzettl 

35 et al. (1992), Mature, 152:855-858). 
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Among the FGF family of proteins , keratinocyte 
growth factor (KGF) is a unique effector oif non- 
f ibroblast epithelial (particularly keratinocyte) cell 
proliferation derived from mesenchymal tissues. The 
S term "native KGF* refers to a natural human (hKGF) or 
recombinant (rKGF) polypeptide (with or without a signal 
. sequence) as depicted by the amino acid sequence 
presented in SEQ ID NO: 2 or an allelic variant thereof. 
[Unless otherwise indicated, amino acid numbering for 

10 molecules described herein shall correspond to that 
presented for the mature form of the native molecule 
(i.e., minus the signal sequence) , as depicted by amino 
acids 32 to 194 of SEQ ID NO:2.] 

Native KGF may be isolated from natural 

15 sources. For example , hKGF can be isolated from medium 
conditioned by an embryonic lung fibroblast cell line - 
(Rubin et al. (1989), supra. Three chromatographic 
steps r namely heparin- Sepharose 1 * (Pharmacia, Piscataway, 
NJ) af finity chromatography, HPLC gel filtration, and 

20 reverse-phase HPLC, were used to obtain a purified hKGF 
preparation. Approximately 6 mg of hKGF were recovered 
from 10 liters of conditioned medium. . These 
chromatographic steps only recovered 0.8% total hKGF 
based upon a mitogenic activity assay. A further 

25 example teaches the use of another chromatographic step 
using heparin- Sepharose 1 " affinity and Mono-S™ ion- 
exchange chromatography^ (Pharmacia, Piscataway, NJ) for 
isolation of rKGF produced in bacteria (Ron et al. 
(1993), Journal of Biological Chemistry, 251:2984-2988). 

30 The properties of keratinocyte growth factors 

suggest a potential for the application thereof as a 
drug for promoting specific stimulation of epithelial 
cell growth. It therefore would be desirable to develop 
a method or methods for obtaining relatively high levels 

35 of homogeneous keratinocyte growth factors to provide 
sufficient quantities of material for comprehensive in 



WO 96/11952 FCT/US95/13099 

- 3 - 

vitro and in vivo biological evaluation and for a 
potential therapeutic application. 

It is the object of this invention to provide 
a novel method for the purification of keratinocyte 
5 growth factors. 

Summary of the- Invention 

The present invention is directed to a first 
10 method for purifying a keratinocyte growth factor (KGF) , 
the method comprising: 

a) obtaining a solution containing KGF; 

b) binding KGF from the solution of part (a) 
to a cation exchange resin; 

15 c) eluting KGF in an eluate solution from 

the cation exchange resin; 
d) passing the eluate solution from part (c) 
through a molecular weight exclusion 
matrix; and 

20 e) recovering KGF from the molecular weight 

exclusion matrix. 

The^ invention is further directed to a second 
method for purifying a keratinocyte growth factor (KGF) , ~ 
25 . the method comprising: 

a) obtaining a solution containing KGF; 

b) binding KGF from the solution of part (a) 
to a cation exchange resin; 

c) eluting KGF in an eluate solution from 
30 the cation exchange resin; 

d) performing hydrophobic interaction 
chromatography on the eluate solution of 
part (c) ; and 

e) recovering KGF from the hydrophobic 

35 interaction chromatography step of part 

(d). 
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Generally , the cation exchange chromatography 
step of the first or second methods may be conducted 
with any suitable buffer (e.g., phosphate buffer saline, 
sodium acetate or tris-HCL) at a pK of preferably 
5 between about 6.8-7.5. Suitable columns for use in thi s 
step include carboxymethyl cellulose, carboxymethyl 
agarose and sulfated agarose and cellulose columns 
(e.g. , columns of S-Sepharose Fast Flow"* resin, Mono-S 1 * 
resin and CM-cellulose 1 " resin, commercially available 

10 from Pharmacia, Piscataway, NJ) . The flow rate will be 
variable depending upon" the column size. 

The gel filtration step of the first method may be 
conducted in any suitable buffer (e.g., phosphate buffer 
saline) at a pH of preferably between about 7.0 and 7i5. 

15 Suitable columns for use in this step include agarose- 
based, acrylamide-based, silica-based or polymer-based ' 
size-exclusion columns (e.g.* columns of Sephadex G-75** 
resin and Superdex-75 ,w resin, commercially available 
from Pharmacia). 

20 In a particularly preferred embodiment of the 

second method, free sulfhydryl groups may be oxidized 
prior to the hydrophobic interaction step, discussed 
below. -Any manner of oxidation may be employed. For 
exaiqple, the protein may be exposed to atmospheric 

25 oxygen for a suitable pcfiriod* of , time. Alternatively, 
various oxidation procedures may be. employed. One such 
procedure is particularly suited for keratinocyte growth 
factors wherein one or more cysteine residues, as 
compared to the native KGF molecule, are deleted or 

30 replaced. In this procedure an oxidizing agent (e.g., 
cyst amine dihydrochloride or another appropriate 
oxidizing agent, for instance, cystine, oxidized 
glutathione or divalent copper) may be added to a final 
concentration, adjusting the pH to preferably between 

35 about 7-9.5, with pH 9.0 ± 0.3 # C being more preferred 
when using cyst amine dihydrochloride) , and holding the 
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temperature at preferably between about 10-30 # C, for an 
appropriate period. The second procedure may be used 
for oxidizing native KGF and other keratinocyte growth 
factors with comparable patterns of cysteine residues. 
5 In this procedure, oxidation may be accomplished by 
adding an appropriate amount of an ionic strength 
modifier (e.g., (NH^SO^), adjusting the pH to 
preferably between about 7.5-9.5, and holding the 
temperature at preferably between about 23 ± 5 # C for an 

10 appropriate period. 

The hydrophobic interaction step of the second 
method may be conducted by using any suitable buffer 
(e.g. , sodium phosphate) at a pH of preferably between 
about 6.0-8.0, more. preferably about 7.0, and by eluting 

15 • with a decreasing linear (NH4)2S04 gradient ranging from 
2-0 M. Suitable - columns for use in this step include 
alkyl or phenyl substituted resins ( e.g.., a column of 
Butyl-650M Toyopearl** resin, commercially available from 
Tosohaas, Inc., Montgomeryville, PA and columns of 

20 phenyl Sepharose 1 " resin and phenyl Superose™ resin, 
commercially available from Pharmacia) ; 

The process of the present invention may be 
used to purify KGF. Thus, it should be tinders tood that 
the terms -keratinocyte growth factor" and "KGF* as 

25 employed in this description ate intended to include, 
and to mean interc h ang eab ly unless otherwise indicated, 
native KGF and KGF analog proteins (or "muteins") 
characterized by a peptide sequence substantially the 
same as the peptide sequence of native KGF and by 

30 retaining some or all of the biological activity of 

native KGF, particularly non- fibroblast epithelial cell 
proliferation (e.g., exhibiting at least about 500-fold 
greater stimulation of BALB/MK keratinocyte cells than 
that of NIH/3T3 fibroblast cells, and at least about 50- 

35 fold greater stimulation of BALB/MK keratinocyte cells 
than for BS/589 epithelial cells or for CC1208 
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epithelial cells, as determined by H- thymidine 
incorporation) . By "characterized by a peptide sequence 
substantially the same as the peptide sequence of native 
KGF" is meant a peptide sequence which is encoded by a 
5 DNA sequence capable of hybridizing to nucleotides 201 
to 684 of SEQ ID NO:l, preferably under stringent 
hybridization conditions. 

The determination of a corresponding amino 
acid position between two amino acid sequences may be 

10 determined by aligning the two sequences to maximize 

matches of residues including shifting the amin o and/or 
carboxyl terminus, introducing gaps as required and/or 
deleting residues present as inserts in the candidate. 
Database searches, sequence analysis and manipulations, 

15 may be performed using one of the well-known and 
routinely used sequence homology/identity scanning 
algorithm programs (e.g., Pearson and Lipman (1988), 
Proc. Natl. Acad. Sci. U.S.A., jL£:2444-2448; Altschul et 
al. (1990), J. Mol. Biol., 215: 403-410; Lipman and 

20 Pearson (1985), Science,' 222:1435 or Devereux et al. 
(1984), Nuc. Acids Res., 12:387-395). 

Stringent conditions, in the hybridization 
context, -will be stringent combined conditions of salt, 
temperature, organic solvents and other parameters 

25 typically controlled in hybridization reactions. 
Exemplary stringent hybridization conditions- are 
hybridization in 4 X SSC at 62-67° C, followed by 
washing in 0.1 X SSC at 62-67° C. for approximately an 
hour. Alternatively, exemplary stringent hybridization 

30 conditions are hybridization in 45-55% formamide, 4 X 
SSC at 40-45°C. [See, T. Maniatis et. al., Molecular 
Cloning (A Laboratory Manual) ; Cold Spring Harbor 
Laboratory (1982), pages 387 to 389]. 

Thus, the proteins include allelic variations, 

35 or deletion(s) , substitution (s) or insertion(s) of amino 
acids, including fragments, chimeric or hybrid molecules 
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of native KGF. One example of KGF includes proteins 
having residues corresponding to Cys 1 and Cys 15 , of SEQ ID 
NO: 2 replaced or deleted, with the resultant molecule 
having improved stability as compared with the parent 
5 molecule (as taught in commonly owned U.S. S.N. 

08/487,825, filed on July 7, 1995). Another example of 
KGF includes charge-change polypeptides wherein one or 
more of amino acid residues 41-154 of native KGF 
(preferably residues Arg 41 , Gin 43 , Lys 55 , Lys 95 , Lys 12 *, 

10 Asn 137 , Gin 138 , Lys 139 , Arg 144 , Lys 14 " 7 , Gin 15 *, Lys 1 " or 

Thr 154 ) are deleted or substituted with a neutral residue 
or negatively charged residue selected to effect a 
protein with a reduced positive charge (as taught in 
commonly owned U.S. S.N. 08/323,337, filed on October 13, 

15 1994) . A still further example of KGF includes proteins 
generated by substituting at least one amino acid having 
a higher loop- forming potential for at least one amino 
acid within a loop- forming region of Asn 13 -5-Hi s 116_ 
Tyr 117 -Asn 118 -Thr 115 of native KGF (as taught in 

20 commonly owned U.S. S.N. 08/323,473, filed on October 13, 
-1994) . - A still yet further example includes proteins 
having one or more amino acid substitutions, deletions 
or additions within a region of 123-133 (amino acids 
154-164 of SEQ ID NO:2) of native KGF; these proteins 

25 may have agonistic or antagonistic activity. 

Specifically disclosed proteins include the 
following KGF molecules (referred to by the residue 
found at that position in the mature protein (minus 
signal sequence) set forth in SEQ ID NO: 2, followed by 

30 that amino acid position in parentheses and then either 
the substituted residue or ■-■ to designate a deletion) : 
C(1,15)S, AN15-AN24, AN3/C(15)S, AN3/C(15)-, £N8/C(15)S, 
AN8/C(15)-, C(1,15)S/R(144)E, C (1, 15) S/R (144) Q, 
AN23/R(144)Q, C(1,15,40)S, C (1, 15 , 102 ) S, 

35 C(l, 15,102, 106)S, AN23/N(137)E, AN23/K(139)E, 

AN23/K(139)Q, AN23/R(144) A, AN23/R(144) E, AN23 /R (144) L, 
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AN23/K(147)E, AN23/KU47) Q, AN23/K(153)E, AN23/K(153)Q, 
AN23/Q(152)E/K(153)E; R(144)Q and H(116)G. 

As those skilled in the art will also 
appreciate, a variety of host-vector systems may be 
5 utilized to express the KGF protein-coding sequence. 
These include but are not limited to eucaryotic cell 
systems such as mammalian cell systems infected with 
virus (e.g., vaccinia virus, adenovirus, etc.); insect 
cell systems infected with virus (e.g., baculovirus) ; 

10 microorganisms such as yeast-containing yeast vectors; 
or to procaryotic cell systems such as bacteria 
transformed with bacteriophage UNA, plasmid DNA, or * 
cosmid DNA. The expression elements of these vectors 
vary in their strengths and specificities . Depending jon 

15 the host-vector system utilized, any one of a number of 
suitable transcription and translation elements may be * 
used. 

Once the protein product of KGF expression has 
been isolated, purified and assayed for KGF activity 

20 (using procedures known to those skilled in the art) , it 
may be formulated in a variety of pharmaceutical 
compositions. Typically, such compositions include a 
suitable*;' usually chemically-defined, carrier or 
excipient for the therapeutic agent and, depending on 

25 the intended form of administration, other ingredients 
as well. The composition can include aqueous carriers 
or consist of solid phase formulations in which KGF is 
incorporated into non-aqueous carriers such as 
collagens, hyaluronic acid, and various polymers. The 

30 composition can be suitably formulated to be 

administered in. a variety of ways, including by 
injection, orally, topically, intranasally and by 
pulmonary delivery. 
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Brief Description of the Drawings 

v 

Figure 1 shows the nucleotide (SEQ ID N0:1) 
and amino acid (SEQ ID N0:2) sequences of native KGF 
5 (the nucleotides encoding the mature form of native KGF 
is depicted by bases 201 to 684 of SEQ ID N0:1 and the 
mature form of KGF is depicted by amino acid, residues 32 
to 194 of SEQ ID NO:2) . 

Figures 2A, 2B and 2C show the plasmid maps of 
10 pCFM115S r * pCFM1656 and pCFM3102 , respectively.-* 

Figure 3 shows the nucleotide (SEQ ID NO: 3} 
and amino acid (SEQ ID NO: 4) sequences of the construct 
RSH-KGF. 

Figure 4 shows the nucleotide (SEQ ID NO: 5) 
15 and amino acid (SEQ ID NO: 6) sequences of the construct 
contained in plasmid KGF. 

Figure 5 shows the chemically synthesized 
OLIGOs (OLIGO#6 through OLIGO#ll; SEQ ID NO:12-17, 
respectively) used to substitute the DNA sequence 
20 between a Kpnl site and an EcoRI site for a Kpnl site 
(from amino acid positions 46 to 85 of SEQ ID No: 6) in 
the construct contained plasmid KGF to produce the 
construct in plasmid KGF(dsd) . 

Figure 6 shows the chemically synthesized 
25 OLIGOs (OLIGO#12 through OLIGO#24; SEQ ID NO:18-30, 
respectively) used to construct KGF(codon optimized) . 

Figure 7 shows the nucleotide (SEQ ID NO: 31) 
and amino acid sequences (SEQ ID NO: 32) of C(1,15)S, a 
KGF analog having substitutions of serine for cysteine 
30 at amino acid positions 1 and 15 of native KGF. 

Figure 8 shows the nucleotide (SEQ ID NO: 33) 
and amino acid sequences (SEQ ID NO : 3 4 ) of 
C(1,15)S/R(144)E, a KGF analog having substi tut ions of 
serine for cysteine at amino acid positions 1 and 15 and 
35 a substitution of glutamic acid for arginine at amino 
acid. position 144 of native KGF. 
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Figure 9 shows the nucleotide {SEQ ID NO:35) 
and amino acid (SEQ ID NO: 36) sequences of 
C(1,15)S/R(144)Q, a KGF analog having -substitutions of 
serine for cysteine at amino acid positions 1 and IS and ;. 
5 a substitution of glutamine f or arginine at amino acid 
position 144 of native KGF. 

Figure 10 shows the nucleotide (SEQ ID NO: 37) 
and amino acid (SEQ ID NO:38) sequences of AN15, a KGF 
analog having a deletion of the first 15 amino acids of 
10 the* N- terminus of native KGF. 

Figure 11 shows the nucleotide (SEQ ID NO: 39) 
and amino acid (SEQ ID NO: 40) sequences of AN23, a KGF 
analog having a deletion of the first 23 amino acids of 
the N- terminus of native KGF. 
15 Figure 12 shows the nucleotide (SEQ ID NO: 41) 

and amino acid (SEQ ID N0:42) sequences of AN23/R(144)Q, 
a KGF analog having a deletion of the first 23 amino 
acids of the N- terminus and a substitution of glutamine 
for arginine at amino acid position 144 of native KGF. 

20 

Description si Specific Endre^iingnt? 

: : Standard methods for many of the procedures 
described in the following examples, or suitable 

25 alternative procedures, are provided in widely 

recognized manuals of molecular biology such as, for 
example, Molecular Cloning, Second Edition, Sambrook et 
al. , Cold Spring Harbor Laboratory Press (1987) and 
Current Protocols in Molecular Biology, Ausabel et al., 

30 Greene Publishing Associates /Wiley- Interscience, New 
York (1990).. 

Examnlg 1 : Preparation of DNA Coding for KGF and KGF Analogs 

35 The cloning of the full-length human KGF gene 

(encoding a polypeptide with the sequence of native KGF) 
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was carried out both by polymerase chain reaction (PCR) 
of RNA from an animal cell and by PCR of chemically 
synthesized {E. coli optimized codon) oligonucleotides 
("OLIGOs") . Both procedures are described below: 
5 PCR amplification using RNA isolated from 

cells known to produce the polypeptide was performed. 
Initially, cells from a human fibroblast cell line 
AG1523A (obtained from Human Genetic Mutant Cell Culture 
Repository Institute For Medical Research, Camden, New 
10 Jersey) were disrupted with guanidium thiocyanate, 
followed by extraction (according to the method of 
Chomyzinski eC al. (1987), Anal. Biochem., 122:156) . 
Using a standard reverse transcriptase protocol for 
total RNA, the KGF cDNA was generated. PCR (PCR#1) 
15 amplification of the KGF gene was carried out using the 
KGF cDNA as template and primers OLIGO#l and 0LIG0#2 
that encode DNA sequences immediately 5 ' and 3* of the 
KGF gene [model 9600 Thermocycler (Perkin-Elmer Cetus, 
Norwalk, CT) ; 28 cycles; , each cycle consisting of one 
20 minute at 94°C for denaturation, two minutes at 60°C for 
annealing, and three minutes at 72°C for elongation]. A 
small aliquot of the PCR#1 product was then used, as 
template for a second KGF PCR <PCR#2) amplification 
identical to the cycle conditions described above except 
25 for a 50°C annealing tenperatute. For expression 

cloning of the KGF gene, nested PCR primers were used to 
create convenient restriction sites at both ends of the 
KGF gene. OLIGO#3 and OLIGO#4 were used to modify the 
KGF DNA product from PCR#2 to include Mini and BamHI 
30 restriction sites at the 5' and 3' ends of the gene, 

respectively [PCR#3; 30 cycles; each cycle consisting of 
one minute at 94°C for denaturation, two minutes at 60°C 
for annealing, and three minutes at 72°C for 
elongation] , This DNA was subsequently cut with Mlul 
35 and BamHI, phenol extracted and ethanol precipitated. 
It was then resuspended and ligated (using T4 ligase) 
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into a pCFM1156 plasmid (Figure 2 A) that contained a 
•RSH* signal sequence to make construct RSH-KGF (Figure 
3) . The ligation products were transformed (accor ding 
to the method of Hanahan (1983), J. Hoi. Biol., 166:557) 
S into E. coli strain FMS (ATCC: 53911) and plated onto 
LB+kanamycin at 28°C. Several transfonnants were 
iselected and grown in small liquid cultures containing 
20 jig/mL kanamycin. The RSH-KGF plasmid was Isolated 
from the cells of each culture and DNA sequenced. 

10 Because of an internal Ndel site in the KGF gene,, it was 
not possible to directly clone the native gene sequence 
into the desired expression vector with the bracketed 
restriction sites of Ndel and BamHI. This was 
accomplished as a three-way ligation. Plasmid RSH-KGF . 

15 was cut with the unique restriction sites of BsmI and 

SstI, and a -3 kbp DNA fragment (containing the 3 ; end • 
of the KGF gene) was isolated -following electrophoresis 
through a 1% agarose gel. A PCR (PCR#4) was carried out 
as described for PCR#3 except for the substitution of 

20 OLIGO#5 for 0LIG0#3. The PCR DNA product was then cut 
with Ndel and BsmI and a 311 bp DNA fragment was 
isolated following electrophoresis through a 4% agarose 
gel. The third fragment used in the ligation was a 1.8 
kbp DNA fragment of pCFM1156 cut with JVdeJ and SstI 

25 isolated following electrophoresis through a 1% agarose 
gel. Following ligation (T4 ligase) , transformation, 
kanamycin selection and DNA sequencing as described 
above, a clone was picked containing the construct in 
Figure 4, and the plasmid designated KGF. Because of an 

30 internal ribosomal binding site that produced truncated 
products, the KGF DNA sequence between the unique Kpnl 
and EcoRI sites^was replaced with chemically synthesized 
OLIGOs (OLIGO#6 through OLIGO#ll) to minimize the use of 
the internal start site (Figure 5) . 
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OLIGOtl (SEQ ID NO:7) : 5 ■ -CAATGACCTAGGAGTAACAATCAAC-3 1 

OLIGO#2 (SEQ ID NO: 8) : 5 ■ -AAAACAAACATAAATGCACAAGTCCA-3 * 

OLIGO#3 {SEQ ID NO: 9 ) : 5 1 -ACAACGCGTGCAATGACATGACTCCA-3 ' 

OLIGO#4 (SEQ ID NO:10) : 
5 5 • -ACAGGATCCTATTAAGTTATTGCCATAGGAA-3 ' 

OLIGO#5 (SEQ ID NO: 11) : 

S ' -ACACATATGTGCAATGACATGACTCCA-3 ■• 
OLIGO#6 (SEQ ID NO: 12) : 

5 ' rCTGCGTATCGACAAACGCGGCAAAGTCAAGGGCACCC - 3 « 
10 OLIGOf 7 (SEQ ID NO:13) : 

5 1 -AAGAGATGAAAAACAACTACAATATTATGGAAATCCGTA C T G TT-3 ' 
OLIGO#8 (SEQ ID NO:14): 

- S ' ~G CTU1U ' G GTATCGTT G CAATCAAAG G1X» 1" 1V AATCTG" > 3 * 
OLIGO#9 (SEQ ID NO:15): 
15 5 * -TCTTGGGTGCCCTTGACTTTKCGCGTTTGTCGATACGCAGGTAC-3 ' 

OLIGOflO (SEQ ID NO: 16): 

5 ' -ACAGCAACAGTACGGATTTCCATAATATTCTAGTTGTTTTTCATC-3 1 
OLIGO#ll (SEQ ID NO : 17 ) : 

5 1 -AATTCAGATTCAACACCTTTGATTGCAACGATACCA-3 • 

20 

The OLIGOs were phosphorylated with T4 
polynucleotide kinase and then heat denatured. The 
single-strandeci (ss) OLIGOs were then allowed to form a 
ds DNA fragment by allowing the tempera ture to slowly 
25 decrease to room temperature. *T4 ligase was then used 
to covalently link both the internal OLIGO sticky-ends 
and the whole ds OLIGO fragment to the KGF plasmid cut 
with Kpnl and EcoKZ. The new plasmid was designated 
KGF(dsd) - 

30 A completely E. coli codon-optimized KGF gene 

was constructed by PCR amplification of chemically 
synthesized OLIGOs #1^ through 24. 
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0LIGO#12 (SEQ ID NO: 18) : 5 ' -AGTTTTGATCTAGAAGGAGG-3 • 
OLIGO#13 (SEQ ID NO : 19 ) : 5 • -TCAAAACTGGATCCTATTAA-3 • 
OLIGO#14 (SEQ ID NO:20) : 

5 ' -A (/lTn * lS ATCTAGAAGGAGGAATAACATATGTGCAACGACATGAC- 
5 TCCGGAACAGATGGCTACCAACGTTAACTGCTCCAGCCCGGAACGT- 3 1 

OLIGOflS (SEQ ID NO: 21) : 

5 * -CACACCCGTAGCTACGACTACATGGAAGGTXKjTGACATCCGTGTTC- 
GTCGTCTGTTC TGCCGTACCCAGTGOTACCTGCGTATCGACAAA- 3 • 
. OLIGO#16 (SEQ ID NO:22) : 
10 5 * -CGTGGTAAAGTTAAAGGTACCCAGGAAATGAAAAACAACTA- 

CAACATCATGGAAATCCCTACTCTrTGCTGTTC 3 '. 

OLIGO#17 (SEQ ID NO:23 ) : 

5 ' -GGTGTTGAATCTGAATTCTACCTGGCAATGAACAAAGAAGGTAAAC - 
TGTACGCAAAAAAAGAATGCAACGAAGACTGCAACTTCAAAGAA- 3 * 
15 OLIGO#18 (SEQ ID N0:24) : 

5 1 -CTGATCCTGGAAAACCACTACAACACCTACGCATCTGCTAAATGGA- . 
CCCACAACGGTGGTGAAATGTTCGTTGCTCTGAACCAGAAAGGT- 3 ' 
OLIGOil9 (SEQ ID NO:25) : 

5 ' -ATCCCGGTTCGTGGTAAAAAAACCAAAAAAGAACAGAAAACCGCT- 
20 CACTTCCTGCCGATGGCAATCACTTAATAGGATCCAGTTTTGA-3 ■ 

OLIGO#20 (SEQ ID NO:26) : S ' -TACGGGTGTGACGTTCCGGG-3 * 
OLIGO#21 (SEQ ID NO : 27 ) : 5 * -CTTTACCACGTTTGTCGATA-3 1 
OLIGO#22 : ( SEQ ID NO : 28 ) : 5 ' - ATTCAACACCTTTGATTGCA- 3 ' 
OLIGO#23 (SEQ ID NO : 29 ) : 5 ' -CCAGGATCAGTTCTTTGAAG-3 1 
25 OLIGO#24 (SEQ ID NO:30) : 5 1 «rGAACCGGGATACCTTTCTGG-3 * 

OLIGOs #12 through 24 were designed so that 
the entire DNA sequence encoding native KGF was 
represented by OLIGOs from either the "Watson- or the 

30 •Crick" strand and upon PCR amplification would produce 
the desired double- stranded DNA, sequence (Figure 6) 
[PCR#5, Model 9OT0 thennocycler ( Perkin-Elmer Cetus) ; 21 
cycles, each cycle consisting of 31 seconds at 94°C for 
denaturation, 31 seconds at 50°C for annealing, and 31 

35 seconds at 73°C for elongation; following the 21 cycles 
the PCR was finished with a final elongation step of 7 
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minutes] . After PCK amplification, the DNA fragment was 
cut with. Xbal and BamHT and the 521 bp fragment ligated . ■ 
into the expression plasmid pCFM115 6 cut with the same 
enzymes. PCR#5 utilized the outside primers 
5 (100 pmoles/100 yl ran) OLIGO#12 and OLIGO#13 and 
. 1 pl/100 jil rxn of a KGF template derived by ligation 
(by T4 ligase) of OLIGOs #14 through #19 (OLIGOs#15 
through OLIGOs#18 were phosphorylated with T4 
polynucleotide kinase) using OLIGOs #20 through OLIGOs #2 4 

10 as band-aid oligos (Jayaraman et al. (1992), 

Bioteclmiques, 12:392) for the ligation. The final 
construct was designated KGF (codon optimized) . 

All of the KGF analogs described herein are 
composed in part from DNA sequences found in KGF(dsd) or 

15 • KGF (codon optimized), or a combination of the two. The 
-/ sequences '. are further modified by the insertion into 
convenient- restrictions sites of DNA sequences that 
encode the particular KGF analog amino acids made 
utilizing one or more of the above-described techniques . 

20 for DNA. fragment synthesis. Any of the analogs can be 
generated in their entirety by either of the above 
described techniques. However, as a part of the general 
OLIGO design optimized E. coli codons were used where 
appropriate, although the presence of E. coli optimized 

25 codons in part or in toto of any of the genes where 
examined did not significantly increase the yield of 
protein that could be obtained from cultured bacterial 
cells. Figures 7 to 12 set forth by convenient example 
particular KGF analog nucleotide and amino acid sequence 

30 constructions: C(1,15)S (Figure 7); C(1,15)S/R(144)E 
(Figure 8); C(l, 15)S/R(144)Q (Figure 9); AN15 (Figure 
10); AN23 (Figure 11) and AN23/R(144)Q (Figure 12). All 
the KGF analog constructions described herein were DNA 
sequence confirmed. 
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T^Trmle 2 : Purification from E. coli 

Three different expression plasmids were 

5 utilized in the cloning of the KGF analog genes. They 

were pCFM1156 (ATCC 69702), pCFM1656 (ATCC 69576), and 

pCFM3102 (Figures 2A, 2B and 2C, respectively) . The;; 

plasmid p3102 can be derived from the plasmid pCFM1656 

by making a series of site directed base changes with 

10 PCR. "overlapping oligo mutagenesis. Starting with the 

Bglll site (pCFMl 65 6 plasmid bp #180) immediately 5* to 

the plasmid replication promoter, PcopB/ proceeding 

toward the plasmid replication genes, the base pair 

changes are fits follows: 
15 ' 
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pfWIfigg bn # tan in nTyM1656 tan rhangpd fca in nfP»m m 







# 


204 


T/A 


C/G 






# 


428 


A/T 


G/C 






# 


509 




A/ 1 


5 




# 


617 




insert two G, 






# 


677 


G/C 


T/A 






# 


978 


rp /TV 
1/ A 








# 


992 


//■» 


A/ 1 






# 


1002 


A/ 1 




10 




* 


1005 


C/G 


T/A 






# 


1026 


A/T 


T/A 






# 


1045 


C/G 


T/A 






* 


1176 


G/C 


T/A 






# 


1464 ' 


G/C 


T/A 


15 




# 


2026 


G/C 


bp deletion 






# 


2186 


C/G 


T/A- 






# 


2479 


A/T 


T/A 




# 


2498-2501 


AGTG 


GTCA 


20 








. TCAC 


CAGT 




# 


2641-2647 


TCCGMC 


bp deletion 










AGGCTCG 




25 


# 


3441 


G/C 


A/T 




# 


3452 


G/C 


A/T 




# 


3649 


A/T 


T/A 



# 4556 — insert bps 

30 (SEQ ID NO: 43) 5 ' -fi ftry^AOTAC?I^GACCTGCAG-3 ' - 

( SEQ ID NO : 44 ) ' 5 1 -CTCGAGTGATCACAGCTGGACGTC - 3 * 



- As seen above, pCFM1156, pCFM1656 and pCFM3102 
are very similar to each other and contain many of the 

35 same restriction sites. The plasmids were chosen by 

convenience, and the vector DNA components can be easily 
exchanged for purposes of new constructs. The host used 
for all cloning was E. coli strain FM5 (ATCC: 53911) and 
the transformations were carried out (according to the 

40 method of Hanahan (1983), supra) or by electro elut ion 
with a Gene Pulser™ transfection apparatus (BioRad 
Laboraties, Inc., Hercules, CA) , according to the 
manufacturer's protocol. 

Initially, a small, freshly-cultured inoculum 

45 of the desired recombinant E. coli clone harboring the 



WO 1552 PCT/US95O3099 

- 18 - 

desired construct on one o$ the three pCFM vectors was 
started by transferring 0.1 mL of a frozen glycerol 
stock of the appropriate strain into a 2 L flask 
containing 500 mL of Luria broth. The culture was 
5 shaken at 30 *C for 16 hours. Thereafter the culture was 
transferred to a IS L f ermentor containing 8 L of 
sterile batch medium (Tsai, et al. , (1987) J. Industrial 
Microbiol., 2:181-187). 

Feed batch fermentation starts with the 

10 feeding of Feed # 1 medium (Tsai, ec al. (1987.), 

supra) . When the OD600 reached 35, expression of the 
desired KGF analog was induced by. rapidly raising the 
culture temperature to 37'C to allow the amplification 
of plasmid. After two hours at 37 *C, the culture 

15 temperature was quickly raised to 42 # C to denature the 

CI repressor and the addition of Feed 1 was discontinued* 
in favor of Feed 2, the addition rate of which was 
initiated at 300 mL/hr. Feed 2 comprised 175 g/L 
" trypticase-peptone, 87.5 g/L yeast extract, and 260 g/L 

20 glucose. After one hour at 42 "C, the culture 
temperature was decreased to 36 "C, where this 
. temperature was then maintained for another 6 hours . 

.vThe fermentation was then halted and the cells 
were harvested by centrifugation into plastic bags 

25 placed within 1 L centrifuge bottles. The cells were 
pelleted by centrifugation at 400 rpm for 60 minutes, 
after which the supernatants were removed and the cell 
paste frozen at -90 *C. . 

Following expression of the various KGF 

30 analogs in E. coli, native KGF, C(1,15)S, 

C(1,15)S/R(144)E, C(1,15)S/R(144)Q, AN15, M23, and 
AN23/R(144)Q protein were purified using the following 
procedure. Cell paste from a high cell density 
fermentation was suspended at 4*C in 0.2 M NaCl, 20 mM 

35 NaP04, pH 7.5 as a 10-20% solution (weight per volume) 
using a suitable high shear mixer. The suspended cells 
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were then lysed by passing the solution through a 
homogeni zer (AFV Gaulin, Inc., Everett/ MA) three times. 
The outflowing homogenate was cooled to 4-8 *C by using a 
suitable heat exchanger. Debris was then removed by 
5 centrifuging the lysate in a J-SB 01 centrifuge (BecJcnan - 
Instruments, Inc., Brea, CA) equipped with a JS 4.2 
rotor at 4,200 rpm for 30-60 min. at 4 # C. Siipernatants 
were *">>*™ carefully decanted and loaded onto a 
previously prepared 450 mL(5 cm x 23 cm) column of S- 

10 Sepharose Fast Flow 1 * resin (Pharmacia) column 

equilibrated with 0.2 M NaCl, 20 mM NaP0 4 , pH 7.5 at 
4*G. Next, the column, was washed with five column 
volumes (2250 mL) of 0.4 M NaCl, 20 mM NaP0 4 , pH 7.5 at 
4*C. The desired protein was eluted by washing the 

15 column with 5 L of 0.5 M NaCl, 20 mM NaP0 4 , pH 7.5. 

Again, 50 mL fractions were collected and the A28O of , the 
effluent was continuously monitored. Fractions 
identified by A28O as containing eluted material were 
, then analyzed by SDS-PAGE through 14% gels to confirm 

20 the presence of the desired polypeptide. 

Those fractions containing proteins of 
interest were th *n pooled, followed by the addition of 
an equal volume of distilled water. The diluted sample 
was then loaded onto a previously prepared 450 mL (5 an 

25 x 23 cm) column of S-Sepharose'Fast Flow equilibrated 

with 0.4 M NaCl, 20 mM NaP0 4 , pH 6.8 at 4*C. The column 
was washed with 2250 mL of 0.4 M NaCl, 20 mM NaP0 4 , pH 
6.8 and the protein eluted using a 20 column volume 
linear gradient ranging from 0.4 M NaCl, 20 mM NaP0 4 , pH 

30 6.8 to 0.6 M NaCl, 20 mM NaP0 4 , pH 6.8. Again, 50 mL 
fractions were collected under constant. A28O monitoring 
of the effluent. Those fractions containing the protein 
(determined by 14% SDS-PAGE) were then pooled, followed 
by concentration through a YM-10 membrane (10,000 

35 molecular weight cutoff) in a 350cc stirring cell 

(Amicon, Inc. Mayberry, MA) to a volume of 30-40 mL. 
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The concentrate was then loaded onto a 
previously generated 1,300 ml# (4.4 cm x 85 an) column of 
Superdex-75™ resin (Pharmacia) equilibrated in col umn 
buffer comprising IX PBS (Dulbecco's Phosphate Buffered 
5 Saline, *D-PBS, ■ calcium and magnesium- free) or 0.15 M 
NaCl, 20 mM NaP04, pH 7.0. After allowing the sample to 
run into the column, the protein was eluted from the gel 
filtration matrix using column buffer. Thereafter, 
10 mL fractions were recovered and those containing the 

10 analog (determined by 14% SDS-PAGE) were pooled. 

Typically, the protein concentration was about 5-10 
mg/mL in the resultant pool. All of the above 
procedures were performed at 4-8 *C, unless otherwise 
specified. . 

15 An alternative purification procedure was used 

to purify native KGF, C(1,15)S and AN23. The procedure - 
involves the following steps and, unless otherwise 
specified, all procedures, solutions and materials were 
conducted at 23 ± 5*C. 

20 Upon completion, of the production phase of a 

bacterial fermentation, the cell culture was .cooled to 
4-8 *C and the cells were harvested by centrifugation or 
a similar- process. On the basis of the expected yield 
of protein per unit weight of cell paste and the amount 

25 of purified protein required, an appropriate amount of 
cell paste, by weight, was suspended in a mild buffer 
solution, 20 mM NaP0 4 , 0.2 M NaCl, pH*7.5, weighing 
about five times that of the cell paste to be suspended. 
The cells were dispersed to a homogeneous- solution using 

30 a high shear mixer. The temperature of the cell paste 
dispersion was maintained at 4-8 *C during 
homogeni za t ion . 

The cells were then lysed by pressure, for 
example by passing the cell paste dispersion twice 

35 through an appropriately sized cell homogenizer. The 
homogenate was kept chilled at 5 i 3°C. To clarify the 
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cell lysate, a previously prepared depth filter housing 
(Cuno, Inc., Meriden, CT) equipped with a filter having 
an appropriate amount of filter surface area, 
equilibrated with a suitable volume of 0.2 M NaCI, 20 mM 
5 NaP04, pH 7.5 was employed. The equilibration and 
clarification were performed at 5 + 3*C. Prior to 
clarification, an appropriate amount of a suitable 
filter aid was used to pre-coat the filter and be 
thoroughly mixed with the cell lysate, after which the 

10 lysate was clarified by passing the solution through the 
filter apparatus. The filter was washed with 0.2 M 
NaCI, 20 mM NaPO^ pH 7.5. The filtrate and any 
subsequent wash were collected in a chilled container of 
suitable capacity, all the while being m ai n tained at 

15 less than 10 *C. 

Following clarification the lysate was then 
passed through a previously prepared column of 
SP-Sepharose Fast Flow containing at least 1 mL of resin 
per 2 g of cell paste. The column of SP-Sepharose Fast 

20 Flow was equilibrated with cold (5 ± 3'C), 0.2 M NaCI, 
20 mM NaP04, pH 7.5. The temperature of the column was 
maintained at less than 10 *C. The clarified lysate 
(5 ± 3*C) was ^en loaded onto the ion exchange column, 
with the absorbance at 280 nm {A280) of eluate being 

25 continuously monitored. After 1 sample loading, the 
column was washed with cold 0.2 M NaCI, 20 mM NaP04, 
pH 7.5, followed by washing with 0.3 M NaCI, 20 mM . 
NaP04, pH 7.5 at 23 ± 5*C. 

To elute the desired protein, a linear 

30 gradient ranging from 0.2-1 M NaCI, 20 mM NaF04, pH 7.5 
was used. Bulk product was collected in several 
fractions on the basiV of the A28O of the eluate. 
Following elution, these fractions were pooled and the 
volume noted. 

35 To oxidize free sulfhydryl groups, an 

oxidation step was performed. For proteins with altered 
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cysteine patterns, as compared to native KGF, an 
oxidizing agent (e.g., cystamine dihydrochloride or 
another appropriate oxidizing agent, for instance, 
cystine, oxidized glutathione or divalent copper) was 
5 added to a final concentration of 1-20 mM and the pH was 
adjusted to 7-9.5", with a pH of 9.0 ±0.3 when cystamine 
dihydrochloride was used. The oxidation was conducted 
at 10 - 30 # C for an appropriate period. For the native 
KGF protein, oxidation was accomplished by adding an 

10 appropriate amount of (NH4)2S0 4 such as 1-2 M (NH4)2SQ4, 
adjusting the pH to 7.5-9.5, and holding the temperature 
at 23 + 5'C for an appropriate period. 

After oxidation, the pH, of the solution was 
adjusted to between 6.5 and 9.5. If necessary, solid . 

15 (NH4)2S04 was added to the solution to a final 

concentration of 2 M. To remove particulates, the 
solution was passed through appropriate clarification 
filters. 

The filtered, oxidized product was then 

20 subjected to hydrophobic interaction chromatography 

(HIC) . The HIC matrice was Butyl-650M Toyopearl» resin 
(Tosohaas , Inc. , Montgomeryville, PA) . The protein- 
containing solution was loaded onto the column, which 
had been previously equilibrated with 2 M (NH4)2S04, 0.15 

25 M NaCl, 20 mM NaP04, pH 7-.0. After sample loading, the 
column was washed with 2 M (NH4)2S04, 0.15 M NaCl, 20 mM 
NaP0 4 , pH 7.0. The desired protein was then eluted 
using a decreasing linear (NH4) 2 SO 4 gradient ranging from 
2-0 M developed in 0.15 M NaCl, 20 mM NaP04, pH 7.0. 

30 When the desired protein began to elute, as indicated by 
an increase in the A 2 80 of the eluate, fractions were 
collected. Ali^Hots of each fraction were then analyzed 
by SDS-PAGE. Those fractions containing the desired 
protein were then pooled, thoroughly mixed, and the 

35 volume of the pool determined, as was the concentration 
of the protein therein. 
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The pooled HIC protein-containing eluate was 
then concentrated and the elution buffer exchanged. 
Typically, proteins were concentrated to 5.0-10.0 mg/xnL. 
Ultrafiltration was conducted using an ultrafiltration 
5 system equipped with a Pellicotf* cassette system 

(Millipore, Inc., Bedford, MA) with an appropriately 
sized cut-off membrane 

After concentration, the sample was 
diafiltered against an appropriate buffer. The 
10 retentate from the concentration step was diafiltered 
against 0.15 M NaCl, 20 mM NaP04, pH 7.0 until the 
conductivity of the retentate was within 5% of the 
conductivity of the 0.15 M NaCl, 20 mM NaP0 4 , pH 7.0 
solution. 

15 In addition, to remove precipitates and 

bacterial endotoxin that might be present, the 
concentrated diafiltered protein-containing sample was 
passed through a 0.1 pm Posidyne» filter (Pall, Inc., 
Cortland, NY) . After determining the protein 

20 concentration of the solution and on the basis of the 
desired concentration of the final, bulk product, the 
solution was diluted with 0.15 M NaCl, 20 mM sodium 
phosphate, pH 7.0, to the desired final concentration. 
A final aseptic filtration through a 0.22 pm filter, was 

25 then performed as the final bulk product was transferred 
to a pyrogen- free container for storage (at about 5* C) 
for further formulation. 



Example 3: Purification from Mammalian Cell Culture 

30 

This example describes the expression, 
isolation, and characterization of two biologically 
active recombinant KGF (rKGF) forms produced in a 
ma mma lian expression system. 
35 The human KGF gene was isolated by PGR 

amplification of cBNA made from normal dermal human 
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fibroblast cells (Clonetec, Inc., Palo Alto, CA). 
Following the making of cDNA by reverse transcriptase, 
PCR was used to amplify the KGF gene. 0LIGO#2S and 
OLIGO#26 were used to amplify the gene out of the cDNA 
and OIiIGO#27 and OLIGO#28 were used to place HindTII and 
Bglll restriction sites at the fragment ends by a second 
PCR amplification, as set forth in Figure 1. 



0LIG0#25 (SEQ ID NO: 45) 

0LIG0#26 (SEQ ID NO: 46) 

OLIGO#27 (SEQ ID NO: 47) 

OLIGO#28 (SEQ ID NO: 48) 



5 1 -CAATCTACAATTCACAGA-3 '. 

5 ' -TTAAGTTATTGCCATAGG-3 1 

5 ' - AACAAAGCTTCTACAATTCACAGATAGGA- 3 1 

5 • - AACAAGATCTTAAGTTATTGCCATAGG- 3 * 



Following clo nin g and DNA sequence 
confirmation, the KGF gene DNA was then used. 
Amplif ication was effected using two primers : 



0LIGO#29 (SEQ. ID. N0:49): 

5 • -CGGTCTAGACCACCATGCACAAATGGATACTGACATGG-3 1 

20 OLIGOi30 (SEQ. ID. NO:50) : 

5 • -GCCGTCGACCTATTAAGTTATTGCCATAGGAAG-3 ■ 

The sense primer, OLIGO#29, included an Xbal 
site and 1 a consensus Kozak translation sequence (5'- 

25 CCACC-3') upstream of the start codon, ATG. The 

antisense primer, OLIGO#30, included a Sail cloning 
site and an additional stop codon. After 18 cycles of 
PCR amplification (30 sec. denaturation at 94*C, 40 sec. 
annealing at 55 # C, and 40 sec. elongation at 72 *C), the 

30 product was digested with Xbal and Sail and ligated with 
a similarly digested DNA. of pDSRa2 (according to the 
methods of Bourdrel et al. (1993), Protein Exp. & 
Purif.. 4:130-140 and Lu et al. (1992), Arch. Biochem. 
Biophys., 22£:150-158) . This resulted in plasmid 

35 KGF/pDSRa2 which placed the human KGF gene between the 
SV40 early promoter and the a-FSH polyadenylation 



WOK/11932 PCT/US95/13099 

- 25 - 

sequences . Two clones were picked and DNA sequence 
analysis confirmed construction of the desired vector. 

Two micrograms of KGF/pDSRa2 DNA were then 
linearized with Pvul. Chinese hamster ovary (CHO) 
5 . cells, seeded the day before at 0.8 x 10* cells/60 mm 
culture dish, were then transfected with the treated DNA 
using a . standard calcium phosphate precipitation method * 
(Bourdrel et al., supra) . Two weeks later, individual 
colonies were picked and transferred into 24-well 
10 .plates. The conditioned media was considered serum- free 
when the cells reached conf luency and aliquots thereof 
were analyzed by Western blotting using a polyclonal 
rabbit antiserum reactive against E. coli-expressed 
human KGF. 

15 Westerns were performed by running samples 

through 12.5% (w/v) SDS polyacrylamide gels, followed by 
electroblotting for 1 hr. at 400 mA onto nitrocellulose 
membranes using a semidry transfer apparatus (Hoefer 
Scientific Instruments, San Francisco, CA) . 20 mM Tris, 

20 150 mM glycine, 20% methanol served as the transfer 
buffer. The nitrocellulose sheets were blocked by 
incubation with 10% normal goat serum in PBS. Rabbit 
anti-serum raided against E. coli-derived KGF was used 
as primary antibody. For use, it was diluted 1/10,0.00 

25 in 1% normal goat serum in PBS 'and incubated with the 
blocked nitrocellulose sheets for 12 hr. at room 
temperature, after which excess antibody was removed by 
three 30 min. washes in PBS. The nitrocellulose 
membranes were then incubated in 100 mL of 1% normal 

30 goat serum in PBS containing Vectastain w biotinylated 
goat anti-rabbit IgG (secondary antibody, Vector Labs, 
Burlingame, CA) , for 30 minutes at room temperature. 
After three 10 minute washes in PBS, a 30 minute room 
. temperature incubation was performed in a 100 mL 

35 solution of 1% normal goat serum containing streptavidin 
and biotinylated peroxidase, prepared according to 
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manufacturer's directions (Vector Labs), Following 
three washes in PBS, KGF cross -reactive material was 
visualized by incubation in a mixture of 60 jiL of 30% 
(w/v) H2O2 in 100 mL of PBS and 50 mg of 4-chloronapthol 
5 in 20 mL of methanol . The reaction was stopped by 
rinsing in water after 10 minutes. 

Analysis of the blots revealed that the KGF- 
specific antibody associated with three distinct protein 
bands, two being closely related with molecular weights 

10 of about 25-29 kDa and one with an estimated molecular 
weight of about 17 kDa,- as compared to the expected 
molecular weight of approximately 18.8 of the 163 amino 
acid mature protein. Additionally, several high 
expressing clones secreting more than 2.0 mg of rKGF per 

15 liter, as judged by Western analysis, were selected and 
expanded into roller bottles (according to the method of 
Lu et al. t supra) to generate large volumes of serum- 
free conditioned medium for purification of KGF by 
cationic exchange chromatography and gel filtration, as 

20 set forth below. 

KGF from 3 L of serum- free conditioned medium 
was purified applying the medium directly to a cation 
exchange : column (5 x 24 cm) packed with 450 mL of 
sulfoethyl column of SP-Sepharose Fast Flow (Pharmacia) 

25 pre-equilibrated with 20 mM sodium phosphate, pH 7.5. 
After w ashin g with five column volumes of 20 mM so d i um 
phosphate, 0.2 M NaCl, pH 7.5, rKGF was eluted using a 
20 column volume linear gradient of 0.2 to 1.0 M NaCl in 
20 mM sodium phosphate, pH 7.5. 50 mL fractions were 

30 collected with continuous A 2 ao monitoring. KGF protein 
was detected by analyzing aliquots" of each fraction by 
SDS-PAGE. SDS-PAGE was performed on an electrophoresis 
system (Novex, San Diego, CA) using precast 14% 
Tris -glycine precast gels (according to the method of 

35 Laemmli (1970), Nature, 222:680-685). Samples were 
mixed with non-reducing SDS sample buffer without 
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heating before loading. The proteins were detected by 
either Coomassie blue or silver staining. Two late- 
eluting peaks were seen to contain protein bands 
corresponding to the 25-29 kDa and 17 kDa bands detected 
5 by Western blot. The fractions containing each of these 
peaks were separately concentrated to a volume of less 
than 1.0 mL and subjected to gel filtration. 

The gel filtrations employed columns of 
SuperdexT75*" resin (HR 10/30, Pharmacia) pre- 

10 equilibrated with PBS, pH 7.2, and calibrated with the 
following known molecular weight standards (BioRad, San 
Francisco, CA) : thyreoglobulin (670 kDa) , gamma globulin 
(158 kDa), ovalbumin (44 kDa), myoglobin (17 kDa) and 
vitamin B-12 (1.4 kDa) . These purification steps 

15 resulted in an approximate 2000-fold purification of 
rKGF, specifically including a 17 kDa and a 30 kDa 
material, as estimated by silver staining. 

In the instance of the higher molecular weight 
. material, rKGF eluted as a major symmetrical peak, which 

20 was called KGF-a. Upon SDS-PAGE analysis of a lesser 

amount of this material, 3 jig/lane versus 6 ^g/lane, two 
bands with a 1-2 kDa molecular weight difference were 
resolved. In the instance of the lower molecular weight 
material, termed KGF-b, gel filtration resulted in a 

25 protein preparation having the expected mobility. For 
both KGF-a and KGF-b, the overall yield after 
purification was approximately 30-40%. 

Amino acid sequences from KGF-a and KGF-b were 
also analyzed. These analyses were performed on an 

30 automatic sequencer (Model 477A or 470A, Applied 

Biosystems, Inc., Foster City, CA) equipped with a Model 
12 OA on-line PTH-amino acid analyzer and a Model 900A 
data collection system (according to the method of Lu ec 
al. (1991), J. Biol. Chem., 2fifi: 8102-8107 ) . Edman 

35 sequence analysis of KGF-a revealed a major N- terminal 
sequence of Xi-N-D-M-T-P-E-Q-M-A-T-N-V-X 2 -X 3 -S- (SEQ ID 
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NO: SI) • A minor sequence s.tarting from the thi rd N- 
terminal amino acid, aspartic acid, was also present in 
1.6% of the total sequenceable protein. X x , X 2 > and X3 
were the unassigned due to the absence of 
5 phenyl thiohydantoinyl (PTH) amino acid signals during 
sequence analysis. . 

Interestingly, N- terminal sequence analysis of 
KGF-b revealed an N-terminal amino acid sequence of 
S-Y-D-Y-M-E-G-G-D-I-R-V- (SEQ ID NO: 52), indicating that 
' . 10 it is an N- terminally truncated form of KGF that has 

been proteolytically cleaved at the Arg 23 -Ser 24 peptide 
bond. 

To further characterize purified KGF-a and 
KGF-b, the protein was subjected to glycosidases » 

15 (neuraminidase, O-glycanase, and/or N-glycanase) , using 
known techniques (Sasaki et al. (1987), *7. Biol. Chem. > \ 
262:12059-12076; Takeuchi et al. (1988), J. Biol. Chem. , 
251:3657-3663; Zsebo et al. (1990) , Cell, £1:195-201) . 
These data indicate that KGF-a contains N- and 0-linked 

20 carbohydrates, although the lower molecular weight form 
of KGF-a probably contains only N-linked sugar. 
Glycosidase treatment did not cause molecular weight 
reduction* for KGF-b, indicating that the molecule is 
unglycosylated . 

25 ' 

Example 4 : Biological Activity 

Each KGF analog was diluted and assayed for 
biological activity by measuring the [ 3 H] -thymidine 

30 uptake of Balb/MK cells (according to the method of 
Rubin et al. (1989), supra) . The samples were first 
diluted in a bio&ssay medium consisting of 50% customer- 
made Eagle's MEM, 50% customer-made F12, 5 yg/mL 
transferrin, 5 ng/mL sodium selenite, 0.0005% HSA and 

35 0.005% Tween 20. KGF samples were then added int 
Falcon primeria 96-well plates seeded with Balb/MK 
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cells. Incorporation of [3h] -Thymidine during DNA 
. synthesis was measured and converted, to input native KGF 
concentration by comparison to a native KGF standard 
curve. Each of the tested analogs exhibited mitogenic 
5 activity. 

Interaction with the KGF receptor was examined 
using isolated KGF receptor membrane preparations 
prepared from Balb/MK mouse epidermal keratinocytes (by 
the procedure described by Massague (19932), J\ Biol, 

10 Cfaem., 25£: 13614-13620) . Specifically, various forms of 
KGF were diluted with 50 mM Tris-HCl, pH 7 . 5 , containing 
0.2% bovine serum albumin so as to range in 
concentration from 0.8 ng to 100 ng per 50 jiL. They 
were individually incubated with the membrane 

15 preparation (75 ng/mL) and 125 I-labeled E* coli-derived 
KGF (1.5 ng) . Receptor binding and competition 
experiments were performed at 4*C for 16 hr. , after 
which - time samples were taken, centrifuged, and washed 
twice with the above diluent buffer to remove unbound 

20 and non-specif ically bound, labeled KGF. Samples were 
then counted for the remaining radioactivity. 
Competition curves for receptor binding between KGF 
samples and labeled KGF were constructed by plotting 
percent uncompetition versus concentrations of each KGF 

25 sample. Radioreceptor assay uncompetition experiments 
indicated that E. coli-derived KGF, KGF-a, and KGF-b 
have similar receptor binding activity. 

While the present invention has been described 
above both generally and, in terms of preferred 

30 embodiments, it is understood that other variations and 
modifications will occur to those skilled in the art in 
light of the description above. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Amgen Inc. 



(ii) TITLE OF INVENTION: Method for Purifying Keratinocyte 

Growth Factors 

(iii) NUMBER OF SEQUENCES: 52 

(iv) CORRESPONDENCE ADDRESS : - 

(A) ADDRESSEE: Amgen Inc. 

(B) STREET: 1840 DeHavilland Drive 

(C) CITY: Thousand Oaks 

(D) STATE: California 

(E) COUNTRY: U.S.A. 
<F> ZIP: 91320-1789 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/487,830 

(B) FILING DATE: 

(C) CLASSIFICATION : not yet known 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 862 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDH3NESS : unknown. 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

CAATCTACAA TTCACAGATA GGAAGAGGTC AATGACCTAG GAGTAACAAT CAACTCAAGA 60 

TTCATTTTCA TTATGTTATT CAT^AACACC CGGAGCACTA CACTATAATG CACAAATGGA 120 

TACTGACATG GATCCTGCCA ACTTTGCTCT ACAGATCATG CTTTCACATT ATCTGTCTAG 180 

TGGGTACTAT ATCTTTAGCT TGCAATGACA TGACTCCAGA GCAAATGGCT ACAAATGTGA " 240 

ACTGTTCCAG CCCTGAGCGA CACACAAGAA GTTATGATT A CATGGAAGGA GGGGATATAA 300 
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GAGTGAGAAG ACTCTTCTGT CGAACACAGT GGTACCTGAG GATCGATAAA AGAGGCAAAG 360 

TAAAAGGGAC CCAAGAGATG AAGAATAATT ACAATATCAT GGAAATCAGG ACAGTGGCAG 420. 

TTGGAATTGT GGCAATCAAA GGGGTGGAAA GTGAATTCTA TCTTGCAATG AACAAGGAAG 480 
GAAAACTCTA TGCAAAGAAA GAATGCAATG AAGATTGTAA CTTCAAAGAA CTAATTCTGG . 540 

AAAACCATTA GAACACATAT GCATCAGCTA AATGGACACA CAACGGAGGG GAAATGTTTG 600 

TTGCCTTAAA TCAAAAGGGG ATTCCTGTAA GAGGAAAAAA AACGAAGAAA GAACAAAAAA 660 

CAGCCCACTT TCTTCCTATG GCAATAACTT AATTGCATAT GGTATATAAA GAACCCAGTT 720 

CCAGCAGGGA GATTTCTTTA AGTGGACTGT TTTCTTTCT T CTCAAAATTT TCTTTCCTTT 780 

TATTTTTTAG TAATCAAGAA AGGCTGGAAA AACTACTGAA AAACTGATCA AGCTGGACTT 840 

GTGCATTTAT GTTTGTTTTA AG ' 862 

(2) .INFORMATION FOR SEQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LQIGTH: 194 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 
* (D) TOPOLOGY: unknown 

<ii) MOLECULE TYPE: protein 



.(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

Met His Lys Trp lie Leu Thr Trp He Leu Pro Thr Leu Leu Tyr Arg 
1 5 _ 10 IS 

Ser Cys Phe His He lie Cys Leu Val Gly Thr He Ser Leu Ala Cys 
20 25 30 

Asn Asp Met Thr Pro Glu Gin Met Ala Thr Asn Val Asn Cys Ser Ser 
35 40 45 

Pro Glu Arg His Thr Arg Ser Tyr Asp Tyr Met Glu Gly Gly Asp He 
50 55 60 

Arg Val Arg Arg Leu Phe Cys Arg Thr Gin Trp Tyr Leu Arg He Asp 
65 . 70 75 80 

Lys Arg Gly Lys Val Lys Gly Thr Gin Glu Met Lys Asn Asn Tyr Asn 
85 90 95 

He Met Glu He Arg Thr Val Ala Val Gly He Val Ala He Lys Gly 
100 105 .110 

Val Glu Ser Glu Phe Tyr Leu Ala Met Asn Lys Glu Gly Lys Leu Tyr 
115 120 125 
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Ala Lys Lys 
130 



Glu Cys Asn Glu Asp 
135 



Cys Asn Phe Lys Glu Leu lie Leu 



140 



Glu Asn His 
145 



Tyr Asn Thr Tyr Ala 
150 



Ser Ala Lys Trp Thr His Asn Gly 




Gly Glu Met 



Phe Val Ala Leu Asn 
165 



Gin Lys Gly lie Pro Val Arg Gly 
170 - 175 



Lys Lys Thr 



Lys Lys Glu Gin Lys 
180 



Thr Ala His Phe Leu Pro Met Ala 
185 190 



lie Thr 



(2) INFORMATION FOR SEQ ID NO: 3: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 595 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown ' 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3 : 

ATCGATTTGA TTCTAGAAGG AGGAATAACA TATGAAAAAG CGCGCACGTG CTATCGCCAT 60 

TGCTGTGGCT CTGGCAGGTT TCGCAACTAG TGCACACGCG TGCAATGACA TGACTCCAGA 120 

GCAAATGGCT ACAAATGTGA ACTGTTCCAG CCCTGAGCGA CACACAAGAA GTTATGATTA 180 

CATGGAAGGA GGGGATATAA GAGTGAGAAG ACTCTTCTGT CGAACACAGT GGTACCTGAG .240 

GATCGATAAA AGAGGGAAAG TAAAAGGGAC CCAAGAGATG AAGAATAATT ACAATATCAT 300 

GGAAATCAGG ACAGTGGCAG TTGGAATTGT GGCAATCAAA GGGGTGGAAA GTGAATTCTA 360 

TCTTGCAATG AACAAGGAAG GAAAACTCTA TGCAAAGAAA GAATGCAATG AAGATTGTAA 420 

CTTCAAAGAA CTAATTCTGG AAAACCATTA CAACACATAT GCATCAGCTA AATGGACACA 480 

CAACGGAGGG GAAATGTTTG TTGCCTTAAA TCAAAAGGGG ATTCCTGTAA GAGGAAAAAA 540 

AACGAAGAAA GAACAAAAAA CAGCCCACTT TCTTCCTATG GCAATAACTT AATAG 595 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 186 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SSQ ID NO: 4: 

Met Lys Lys Arg Ala Arg Ala lie Ala He Ala Val Ala Leu Ala Gly 
1 5 10 15. 

Phe Ala Thr Ser Ala His Ala Cys Asn Asp Met Thr Pro <31u Gin Met 
20 25 30 

Ala Thr Asa Val Asn Cys . Ser Ser Pro Glu Arg His Thr Arg Ser Tyr 
35 40 45 

Asp Tyr Met Glu Gly Gly Asp He Arg Val Arg Arg Leu Phe Cys Arg 
50 55 60 

Thr Gin Trp Tyr Leu Arg He Asp Lys Arg Gly Lys Val Lys Gly Thr - 
65 70 75 80 

Gin Glu Met Lys Asn Asn Tyr Asn He. Met Glu lie Arg Thr Val Ala 
85 90 95 

Val Gly He Val Ala He Lys Gly Val Glu Ser Glu Phe Tyr Leu Ala 
100 105 110 

Met Asn Lys Glu Gly Lys Leu Tyr Ala Lys Lys Glu Cys Asn Glu Asp 
115 120 125 

Cys Asn Phe Lys Glu Leu He Leu Glu Asn His Tyr Asn Thr Tyr Ala 
130 135 140 

. Ser Ala Lys Trp Thr His Asn Gly Gly Glu Met Phe Val Ala Leu Asn 
145 150. 155 160 

Gin Lys- Gly He Pro Val Arg Gly Lys Lys Thr Lys Lys Glu Gin Lys 
16J. 170 175 

Thr Ala His Phe Leu Pro Met Ala He Thr 
180 185 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LB3GTH: 499 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5: 

TATGTGCAAT GACATGACTC CAGAGCAAAT GGCTACAAAT GTGAACTGTT CCAGCCCTGA 60 

GCGACACACA AGAACTTATG ATTACATGGA AGGAGGGGAT ATAAGAGTGA GAAGACTCTT 120 

CT G TC G AACA CAGTGGTACC TGAGGATCGA TAAAAGAGGC AAAGTAAAAG GGACCCAAGA 180 
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GATGAAGAAT AATTACAATA TCATGGAAAT CAGGACAGTG GCAGTTGGAA TTGTGGCAAT 240 

CAAAGGGGTG GAAAGTGAAT TCTA TCTTG C AATGAACAAG GAAGGAAAAC TCTATGCAAA 300 

GAAAGAATGC AATGAAGATT GTAACTTCAA AGAACTAATT CTGGAAAACC ATTACAACAC . 360 

ATATGCATCA GCTAAATGGA CACACAACGG AGGGGAAATG TTTGTTGCCT TAAATCAAAA 420 

GGGGATTCCT GTAAGAGGAA AAAAAACGAA GAAAGAACAA AAAACAGCCC ACTTTCTTCC 480 

TATGGCAATA ACTTAATAG 499 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 164 amino acids 
(BJ TYPE: amino acid . 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUQTCE DESCRIPTION: SEQ ID NO:6: 

Met Cys Asn Asp Met Thr Pro Glu Gin Met Ala Thr Asn Val Asn Cys 

1 ,5 , 10 15 

Ser Ser Pro Glu Arg His Thr Arg Ser Tyr Asp Tyr Met Glu Gly Gly 
20 25 30 

Asp He Arg Val Arg Arg Leu Pne Cys Arg Thr Gin Trp Tyr Leu Arg 
35 40 45 

He Asp Lys Arg Gly Lys Val Lys Gly Thr Gin Glu Met Lys Asn Asn 
50 - 55 60 

Tyr Asn He Met Glu lie Arg Thr Val Ala Val Gly He Val Ala He 
65 70 . 75 80 

Lys Gly Val Glu Ser Glu Phe Tyr Leu Ala Met Asn Lys Glu Gly Lys 
85 90 ' 35 

Leu Tirr Ala Lys Lys Glu Cys Asn Glu Asp Cys Asn Phe Lys Glu Leu 
100 105 HO 

He Leu Glu Asn His Tyr Asn Thr Tyr Ala Ser Ala Lys Trp Thr His 
115 120 125 

Asn Gly Gly Glu MetsShe Val Ala Leu Asn Gin Lys Gly He Pro Val 
130 135 I 40 

Arg Gly Lys Lys Thr Lys Lys Glu Gin Lys Thr Ala His Phe Leu Pro 
145 150 155 I 60 

Met Ala He Thr 
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(2) INFORMATION FOR. SEQ ID NO:7: 



S> • 

(i) S EQU EN CE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 



Ui> 



MOLECULE TYPE: cDNA 



txi) 



SEQUENCE DESCRIPTION: SEQ ID NO: 7: 



CAATGACCTA GGAGTAACAA TCAAC 



25 



(2) INFORMATION FOR SEQ ID NO:8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
AAAACAAACA TAAATGCACA AGTCCA 26 
12) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown ■ » 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE* DESCRIPTION: SEQ ID NO: 9: 
ACAACGCGTG CAATGACATG ACTCCA 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
ACAGGATCCT ATTAAGTTAT TGCCATAGGA A . 
(2) INFORMATION FOR SEQ ID NO:ll: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 
„ (D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

ACACATATGT GCAATGACAT GACTCCA 

(2) INFORMATION FOR SEQ ID NO: 12 : 

• (i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 37 base pairs 
(5) TYPE: nucleic acid 
(CJ STRANDEDNESS: unknown 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
CTGCGTATCG ACAAACGCGG CAAAGTCAAG GGCACCC 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: ' 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
AAGAGATGAA AAACAACTAC AATATTATGG AAATCCGTAC TGTT 
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(2 ) INFORMATION FOR SEQ ID NO : 14 : 

(i) SEQ UENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 
(CJ STRANDEDNESS : unknown 
(D) TOPOLOGY: unknown 

(ii) MOLECULE. TYPE: cDNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
GCTGTTGGTA TCGTTGCAAT CAAAGGTGTT GAATCTG 
(2) INFORMATION FOR SEQ ID NO: IS: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid . 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
TCTTGGGTGC CCTTGACTTT GCCGCGTTTG TCGATACGCA GGTAC 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4S*.base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown , 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
ACAGCAACAG TACGGATTTC CATAATATTG TAGTTGTTTT TCATC 
(2) INFORMATION FOR SEQ ID NO:17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 
•(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQIDNO:17: 
AATTCAGATT CAACACCTTT GATTGCAACG ATACCA ' 35 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown ' 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
AGTTTTGATC TAGAAGGAGG 20 
(2) INFORMATION FOR SEQ ID.NO:19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

TCAAAACTGG ATCCTATTAA 20 

(2) INFORMATION FOR SEQ ID NO: 20: 

(X) SEQUENC E CH ARACTERISTICS: ' 

(A) LENGTH: 91 base pairs 
, (B) TYPE: nucleic acid * 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
AGTTTTGATC TAGAAGGAGG AATAACATAT GTGCAACGAC ATGACTCCGG AACAGATGGC 60 
TACCAACGT? AACTGCTCCA GCCCGGAACG T 91 
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(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SE QUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 
<D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA . 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

CACACCCGTA GCTACGACTA CATGGAAGGT GGTGACATCC GTGTTCGTCG TCTGTTCTGC 60 

CGTACCCAGT GGTACCTGCG TATCGACAAA 90 

(2) INFORMATION FOR SEQ ID NO:22: 

■ •» ' 
* (i)' SEQUQJCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA ■■, 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 
CGTGGTAAAG TTAAAGGTAC CCAGGAAATG AAAAACAACT ACAACATCAT GGAAATCCGT* 60 
ACTGTTGCTG TTGGTATCGT TGCAATCAAA . 90 
(2) INFORMATION FOR SEQ ID NO:23: 

(i) SE QU E NCE CHARACTERISTICS : 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SE QU E NCE DESCRIPTION: SEQ ID NO: 23:^ 
GGTGTTGAAT CTGAATTCTA CCTGGCAATG* AACAAAGAAG GTAAACTGTA CGCAAAAAAA 
GAATGCAACG AAGACTGCAA CTTCAAAGAA 



60 
90 
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(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 
CTGATCCTGG . AAAACCACTA CAACACCTAC GCATCTGCTA AATGGACCCA CAACGGTGGT 60 
GAAATGTTCG TTGCTCTGAA CCAGAAAGGT 90 
(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 88 base pairs 

(B) TYPE: nucleic acid ; 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(xi) SEQUQICE DESCRIPTION: SEQ ID NO:2S: 
ATCCCGGTTC GTGGTAAAAA AACCAAAAAA GAACAGAAAA CCGCTCACTT CCTGCCGATG . 60 

GCAATCACTT AATAGGATCC AGTTTTGA . 88 

(2) INFORMATION FOR SEQ ID NO:26: 

(i) SE QUENCE CHARACTERISTICS: 

(A) LQK3TH: 20 base pairs^ 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

. 20 

TACGGGTGTG ACGTTCCGGG 

(2) INFORMATION FOR SEQ » NO:27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDH2NESS : unJmown 

(D) TOPOLOGY: unknown 
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(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27 
CTTTACCACG TTTGTCGATA 
(2) INFORMATION FOR SEQ ID NO: 28: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) 'MOLECULE TYPE: cONA 



(xi) SEQUE NCE DESCRIPTION: SEQ ID NO: 28 
ATTCAACACC TTTGATTGCA 
(2) INFORMATION FOR SEQ ID NO:29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: . 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unJcnown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29 
CCAGGATCAG TTCTTTGAAG 
(2) INFORMATION FOR SEQ ID NO:30: 

(i) SEQUfc B CK CHARACTSaSTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30 
GAACCGGGAT ACCTTTCTGG 
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(2) INFORMATION FOR SEQ ID NO:31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 495 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:31: 

ATGTCTAATG ATATGACTCC GGAACAGATG GCTACCAACG TTAACTCCTC CTCCCCGGAA 60 

CGTCACACGC GTTCCTACGA CTACATGGAA GGTGGTGACA TCCGCGTACG TCGTCTGTTC 120 

TGCCGTACCC AGTGGTACCT GCGTATCGAC AAACGCGGCA AAGTCAAGGG CACCCAAGAG 180 

ATGAAAAACA ACTACAATAT TATGGAAATC CGTACTGTTG CTGTTGGTAT CGTTGCAATC 240 

AAAGGTGTTG* AATCTGAATT *CTACCTGGCA ATGAACAAAG AAGGTAAACT GTACGCAAAA 300 

AAAGAATGCA ACGAAGACTG CAACTTCAAA GAACTGATCC TGGAAAACCA CTACAACACC 360 

TACGCATCTG CTAAATGGAC CCACAACGGT GGTGAAATGT TCGTTCCTCT GAACCAGAAA 420 

GGTATCCCGG TTCGTGGTAA AAAAACCAAA AAAGAACAGA AAACCGCTCA CTTCCTGCCG' 480 

495 

ATGGCAATCA CTTAA 

(2) INFORMATION. FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 164 amino acids 

(B) TYPE: amino acid 

<C) STRANDOSNESS: unknown 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: 

Met Ser Asn Asp Met Thr Pro Glu. Gin Met Ala Thr Asn Val Asn Ser 
1 S IP 15 

Ser Ser Pro Glu Arg His Thr Arg Ser Tyr Asp Tyr Met Glu Gly Gly 
20' 25 30 

Asp lie Arg Val Arg Arg Leu Phe Cys Arg Thr Gin Trp Tyr Leu Arg 
35 40 45 

He Asp Lys Arg Gly Lys Val Lys Gly Thr Gin Glu Met Lys Asn Asn 
50 55 60 
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Tyr Asa He Met Glu He Arg Thr Val Ala Val Gly He Val Ala He 
65 70 75 80 

Lys Gly Val Glu Ser Glu Phe Tyr Leu Ala Met Asn Lys Glu Gly Lys 
85 90 95 

Leu Tyr Ala Lys Lys Glu Cys Asn Glu Asp Cys Asn Phe Lys Glu Leu 
100 105 110 

He Leu Glu Asn His Tyr Asn Thr Tyr Ala Ser Ala Lys Trp Thr His 
115 120 125 

Asn Gly Gly Glu- Met Phe Val Ala Leu Asn Gin Lys Gly Zle Pro Val 
.130 135 140 

Arg Gly Lys Lys Thr Lys Lys Glu Gin Lys Thr Ala His Phe Leu Pro 
145 150 V 155 ■ 160 

Met Ala Zle Thr 



(2) INFORMATION FOR SEQ ZD NO:33: 

(i) SE QU E N CE CHARACTERISTICS : 

(A) LENGTH: 495 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

ATGTGCAATG ATATGACTCC TGAACAAATG GCTACCAATG TCAACTGTTC CTCTCCGGAG 60 

CGCCACACCC GGAGTTACGA TTACATGGAA GGTGGGGATA TTCGCGTACG T CGTC T G TTC 120 

TGCCGTACCC AGTGGTACCT GCGTATCGAC AAACGCGGCA AAGTCAAGGG CACCCAAGAG 180 

ATGAAAAACA ACTACAATAT TATGGAAATC CGTA CTGTT G CT G TT G GTAT CGTTGCAATC 240 

AAAGGTGTTG AATCTGAATT CTATCTTGCA ATGAACAAGG AAGGAAAACT CTATGCAAAG 300 

AAAGAATGCA ATGAAGATTG TAACTTCAAA GAACTAATTC TGGAAAACCA TTACAACACA 360 

TATGCATCTG CTAAATGGAC CCACAACGGT GGTGAAATGT TCGTTGCTCT GAACCAGAAA 420 

GGTATCCCTG TTCAAGGTAA GAAAACCAACi. £AAGAACAGA AAACCGCTCA CTTCCTGCCG 480 

ATGGCAATCA CTTAA 495 
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(2) INFORMATION FOR SEQ ID NO:34: 

(i) SEQXJENCE CHARACTERISTICS: 

(A) LENGTH: 164 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 



* (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

Met Cys Asn Asp Met Thr Pro Glu Gin Met Ala Tar Asn Val Asn Cys 
1 5 10 15 

Ser Ser Pro Glu Arg His Thr Arg Ser Tyr Asp Tyr Met Glu Gly Gly 
20 25 30 

Asp He Arg Val Arg Arg Leu Phe Cys Arg Thr Gin Trp Tyr Leu Arg 
35 40 . .45 

He Asp Lys Arg Gly Lys Val Lys Gly Thr Gin Glu Met Lys Asn Asn 
50 55 SO 

Tyr Asn He Met Glu He Arg Thr Val Ala Val Gly He Val Ala lie 

' 65 70 . 75 80 

Lys Gly Val Glu Ser Glu Phe Tyr Leu Ala Met Asn Lys Glu Gly Lys 
85 90 « 

Leu Tyr Ala Lys Lys Glu Cys Asn Glu Asp Cys Asn Phe Lys Glu Leu. 
100 1° 5 110 • 

He Leu GluC Asn His Tyr Asn Thr Tyr Ala Ser Ala Lys Trp Thr His 
US 120 125 

Asn Gly Gly Glu Met Phe Val Ala Leu Asn Gin Lys Gly He Pro Val 
130 135 140 

Gin Gly Lys Lys Thr Lys Lys Glu Gin Lys Thr -Ala His Phe Leu Pro 
145 ISO 155 

Met Ala He Thr 

(2) INFORMATION FOR SEQ ID NO:35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LQJGTH: 495 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: CDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ 10 NO: 35: 
ATGTCTAATG ATATGACTCC GGAACAGATG GCTACCAACG TTAACTCCTC CTCCCCGGAA 60 

CGTCACACGC GTTCCTACGA CTACATGGAA GGTGGTGACA TCCGCGTACG TCGTCTGTTC 120 

TGCCGTACCC AGTGGTACCT GCGTATCGAC AAACGCGGCA AAGTCAAGGG CACCCAAGAG 180 

ATGAAAAACA ACTACAATAT TATGGAAATC CGTACTGTTG CTGTTGGTAT CGTTGCAATC 240 

AAAGGTGTTG AATCTGAATT CTATCTTGCA ATGAACAAGG AAGGAAAACT CTATGCAAAG 300 

AAAGAATGCA ATGAAGATTG TAACTTCAAA GAACTAATTC TGGAAAACCA TTACAACACA 3 60 

TATGCATCTG CTAAATGGAC CCACAACGGT GGTGAAATGT TCGTTGCTCT GAACCAGAAA 420 

GGTATCCCTG TTCAAGGTAA GAAAACCAAG AAAGAACAGA AAACCGCTCA CTTCCTGCCG 480 

ATGGCAATCA CTTAA 495 
(2) INFORMATION- FOR SEQ ID NO: 36: 

Ci) SEQUENCE CHARACTERISTICS: 

(A) LEMGTH: 164 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ IDNO:36: 

Met Ser Asn Asp Met Thr Pro Glu Gin Met Ala Thr Asn Val Asn Ser 
1 5 : 10 15 

Ser Ser Pro Glu Arg His Thr Arg Ser Tyr Asp Tyr Met Glu Gly Gly 
20 25 30 

Asp He Arg Val Arg Arg Leu Phe Cys Arg Thr Gin Trp Tyr Leu Arg 
35 40 45-* 

He Asp Lys Arg Gly Lys Val Lys Gly Thr Gin Glu Met Lys Asn Asn 
50 55 60 

Tyr Asn He Met Glu He Arg Thr Val Ala Val Gly He Val Ala He 
65 70 • 75 80 

Lys Gly Val Glu Ser Glu Phe. Tyr Leu Ala Met Asn Lys Glu Gly Lys 
85 90 95 

Leu Tyr Ala Lys Lys Glu Cys Asn Glu Asp Cys Asn Phe Lys Glu Leu 
100 105 HO . 

He Leu Glu Asn His Tyr Asn Thr Tyr Ala Ser Ala Lys Trp Thr His 
115 120 125 
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Asn Gly Gly Glu Met Phe Val Ala Leu Asn Gin Lys Gly He Pro Val 
130 135 140 

Gin Gly Lys Lys Thr Lys Lys Glu Gin Lys Thr Ala His Phe Leu Pro 
14 5 150 155 160 

Met Ala lie Thr 

(2) INFORMATION FOR SEQ ID NO:37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 450 base pairs 

(B) - TYPE: nucleic acid 

" (C) STRANDEDNESS: unknown 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:37: ; 
ATGTCTTCTC CTGAACGTCA TACGCGTTCC TACGACTACA TGGAAGGTGG TGACATCCGC 60. 

GTACGTCGTC TGTTCTGCCG TACCCAGTGG TACCTGCGTA TCGACAAACG CGGCAAAGTC 120 

AAGGGCACCC AAGAGATGAA AAACAACTAC AATATTATGG AAATCCGTAC TGTTGCTGTT 180 

GGTATCGTTG CAATCAAAGG TGTTGAATCT GAATTCTACC TGGCAATGAA CAAAGAAGGT 240 

AAACTGTACG CAAAAAAAGA ATGCAACGAA GACTGCAACT TCAAAGAACT GATCCTGGAA 300 

AACCACTACA ACACCTACGC ATCTGCTAAA TGGACCCACA ACGGTGGTGA AATGTTCGTT 3 60 

GCTCTGAACC AGAAAGGTAT CCCGGTTCGT GGTAAAAAAA CCAAAAAAGA ACAGAAAACC 420 

GCTCACTTCC TGCCGATGGC AATCACTTAA 450 
(2)* INFORMATION FOR SEQ ID NO:38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 149 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

Met Ser Ser Pro Glu Arg His Thr Arg Ser Tyr Asp Tyr Met Glu Gly 
1 5 10 15 

Gly Asp He Arg Val Arg Arg Leu Phe Cys Arg Thr Gin Trp Tyr Leu 
20 25 30 
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Arg lie Asp Lys Arg Gly Lys Val Lys Gly Thr Gin Glu Met Lys Asn 
^35 40 45 

Asn Tyr Asn lie Met Glu He Arg Thr Val Ala Val Gly lie Val Ala 
50 55 60 

He Lys Gly Val Glu Sex Glu Phe Tyr Leu Ala Met Asn Lys Glu Gly 
65 70 75 80 

Lys Leu Tyr Ala Lys Lys Glu Cys Asn Glu Asp Cys Asn Phe Lys Glu 
85 90 95 

Leu He Leu Glu Asn His Tyr Asn Thr Tyr Ala Ser Ala Lys Trp Thr 
100 105 no 

His Asn Gly Gly Glu Met Phe Val Ala Leu Asn Gin Lys Gly He Pro 
115 120 125 

Val Arg Gly Lys Lys Thr Lys Lys Glu Gin Lys Thr Ala His Phe Leu 
130 135 140 

Pro Met Ala He Thr 
145 ' 

(2) INFORMATION FOR SEQ ID NO:39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 426- base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

ATGTCCTACG ACTACATGGA AGGTGGTGAC ATCCGCGTAC GTCGTCTGTT CTGCCGTACC 60 

CAGTGGTACC TGCGTATCGA CAAACGCGGC AAAGTCAAGG GCACCCAAGA GATGAAAAAC 120 

AACTACAATA TTATGGAAAT CCGTACTGTT GCTGTTGGTA TCGTTGCAAT CAAAGGTGTT 180 

GAATCTGAAT TCTACCTCJC AATGAACAAA GAAGGTAAAC TGTACGCAAA AAAAGAATGC 240 

AACGAAGACT GCAACTTCAA AGAACTGATC CTGGAAAACC ACTACAACAC CTACGCATCT 300 

GCTAAATGGA CCCACAACGG TGGTGAAATG TTCGTTGCTC TGAACCAGAA AGGTATCCCG 360 

GTTCGTGGTA AAAAAACCAA AAAAGAACAG AAAACCGCTC ACTTCCTGCC GATGGCAATC 420 
ACTTAA 



426 
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(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUHJCE CHARACTERISTICS: 

(A) LENGTH: X4X amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

Met Ser Tyr Asp Tyr Met Glu Gly Gly Asp He Arg Val Arg Arg Leu 

1 ' • S 10 15 

Phe Cys Arg Thr Gin Trp Tyr Leu Arg He Asp Lys Arg Gly Lys Val 
20 25 t 30 

Lys Gly Thr Gin Glu Met Lys Asn Asn Tyr Asn He Met Glu He Arg 
35 40 45 

Thr Val Ala Val Gly He Val Ala He Lys Gly Val Glu Ser Glu Phe 
50 55 60 

Tyr Leu Ala Met Asn Lys Glu Gly Lys Leu Tyr Ala Lys Lys Glu Cys 
65 70 75 80 

Asn Glu Asp Cys Asn Phe Lys Glu Leu He Leu Glu Asn His Tyr Asn 
85 90 95 

Thr Tyr Ala Ser Ala Lys Trp Thr His Asn Gly Gly Glu Met Phe Val 
100 105. HO 

Ala Leu Asn ^31n Lys Gly He Pro Val Arg Gly Lys Lys Thr Lys Lys 
115 • 120 125 

Glu Gin Lys Thr Ala His Phe Len Pro Met Ala He Thr 
130 135 140 



(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 426 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 



(ii) 



MOLECULE TYPE: ddlA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

ATGTCCTACG ACTACATGGA AGGTGGTGAC ATCCGCGTAC GTCGTCTGTT CTGCCGTACC 60, 
CAGTGGTACC TGCGTATCGA CAAACGCGGC AAAGTCAAGG GCACCCAAGA GATGAAAAAC ; 120 

AACTACAATA' TTATGGAAAT CCGTACTGTT GCTGTTGGTA TCGTTGCAAT CAAAGGTGTT 180 

GAATCTGAAT TCTATCTT G C AATGAACAAG GAAGGAAAAC TCTATGCAAA GAAAGAATGC 240. 
AATGAAGATT GTAACTTCAA AGAACTAATT CTGGAAAACC ATTACAACAC ATATGCATCT 300 

GCTAAATGGA CCCACAACGG TGGTGAAATG TTCGTTGCTC TGAACCAGAA AGGTATCCCT- 360 

GTTCAAGGTA AGAAAACCAA GAAAGAACAG AAAACCGCTC ACTTCCTGCC GATGGCAATC 420 

ACTTAA 426 - 
(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LBJGTH: 141 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:42: 

Met Ser Tyr Asp Tyr Met Glu Gly Gly Asp lie Arg Val Arg Arg Leu 
1 5 10 15 

Phe Cys Arg Thr Gin Trp Tyr Leu Arg He Asp Lys Arg Gly Lys Val 
20 *• 25 30 

Lys Gly Thr Gin Glu Met Lys Asn Asn Tyr Asn He Met Glu He Arg 
35 40 ' 45 

Thr Val Ala Val Gly He Val Ala He Lys Gly Val Glu Ser Glu Phe 
SO 55 60 * 

Tyr Leu Ala Met Asn Lys Glu Gly Lys Leu Tyr Ala Lys Lys Glu Cys 
65 70 75 80 

Asn Glu Asp Cys Asn Phe Lys Glu Leu He Leu Glu Asn His Tyr Asn 
85 90 95 

Thr Tyr Ala Ser Ala Lys Txa Thr His Asn Gly Gly Glu Met Phe Val 
100 105 HO 

Ala Leu Asn Gin Lys Gly He Pro Val Gin Gly Lys Lys Thr Lys Lys 
115 120 125 

Glu Gin Lys Thr Ala His Phe Leu Pro Met Ala He Thr 
130 135 140 
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(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

. (ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 
GAGCTCACTA GTGTCGACCT GCAG • 
(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUQICE CHARACTERISTICS : 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME /KEY: - 

(B) LOCATION: complement (1..24) 



(xi > SEQUENCE DESCRIPTION: SEQ ID NO : 44 : 
CTGCAGGTCG ACACTAGTGA GCTC 
(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: , 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 
CAATCTACAA TTCACAGA 
(2) INFORMATION FOR SEQ ID NO:46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46 

TTAAGTTATT GCCATAGG 

(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 
. (A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47 
AACAAAGCTT CTACAATTCA CAGATAGGA 
(2) INFORMATION FOR SEQ ID NO:48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48 
AACAAGATCT TAAGTTATTG CCATAGG 
(2) INFORMATION FOR SEQ ID NO: 49: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LOJGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

** * 

(ii) MOLECULE TYPE: C0NA 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 49 
CGGTCTAGAC CACCATGCAC AAATGGATAC TGACATGG 
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(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 
GCCGTCGACC TATTAAGTTA TTGCCATAGG AAG 
(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LQJGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDH3NESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 51: 




(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUQTCE CHARACTERISTICS: 

(A) LOJGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

Ser Tyr Asp Tyr Met Glu Gly Gly Asp He Arg 

1 . 5 _ 10 
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WHAT IS CLAIMED IS: 

1 . A method for purifying a keratinocyte growth 
factor (KGF) , the method comprising: 

5 a) obtaining a solution comprising KGF; 

b) binding KGF from the solution of part (a) to a 
cation exchange resin; 

c) eluting KGF in an eluate solution from the 
. .cation exchange resin; 

10 d) .' passing the eluate solution from part (c) 

through an appropriate molecular weight 
exclusion matrix; and 
e) recovering KGF from the molecular weight 
exclusion matrix. 

15 

2. The method according to Claim 1 wherein the 
KGF is produced in procaryotic cells. 

3. The method according to Claim 1 wherein the 
20 KGF is produced in E. coli. 

4. The method according to Claim 1 wherein the KGF 
is produced in" mammalian cells. 

25 5. The method according* to Claim 4 wherein the 

KGF is produced in Chinese hamster ovary cells. 

6. A method for purifying a keratinocyte growth 
factor (KGF), the method comprising: 
30 a) obtaining a solution, comprising KGF; 

b) binding KGF from the solution of part (a) to a 
cation exchange resin; 

c) eluting KGF in an eluate solution from the 
cation exchange resin; 



i 
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10 



15 



d) 



e) 



performing hydrophobic interaction 
chromatography on the eluate solution of part 
(c) ; and 

recovering KGF from' the hydrophobic 
interaction chromatography step of part (d) . 



7. 



A method according to Claim 6 further 
comprising oxidation of free sulfhydryl groups in KGF. 

' a. The method according to Claim 6 wherein the 
KGF is produced in procaryotic cells. 

9. The method according to Claim 7 wherein the 
KGF is; produced in E. coli. 

10. The method according to Claim 6 wherein the 
KGF is produced in mammalian cells. 

11 The method according to Claim 10 wherein the 
20 KGF is produced in Chinese hamster ovary cells . 
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Figure 1 - ..■ 
huaan KGF (+ signal sequence) 

I — -OLZGO#25 1 I OLIGO#1 ■ j 

5 ' CAAXC7 ACAATTCACAGA 3* 5 1 CAATGACCTAGGAGTAACAATCAAC 3' 

5 1 CAATCTACAATTCACAGATAGGAAGAGGTCAATGACCTAGGAGTAACAATCAACTCAA 

-h i h »~ 1 i ■ 60 

-TTCATTrrCATTATGTrATTCAXGAACACCCGGAGCACTACACT 

— 1 — : > » — + ■ 120 

m a X H t 

-TACTGACAtGCATCCTGCCAACTTTGCTCTACAGATCATGCTTTCACATTAT^ 

> »■ ■ ■ . > ■ » + 180 

tTWXLPTLIiYR. SCFHZZCL V 

-TGGGTACTAT A1 CI I IA G C I TVjCA ATGACATGACTCCAGAGCAAATGGCTACAAATGTGA- 

► » ■ ■ ■ ■ ■■ H ■ ■ > ► 240 

G T ' I • S X* A C H DMT P E Q M A T N V M 

* A CTG r i C CAGCCCTGAGCGACACACAAGAAGTr ATGATT AjCATGGAAGGAGGGGAT ATAA^ 

> » »■ » ■■ ► 300 

CSSPERHTR STDTHEGGD IR 

-GAGTGAGAAGACTCT rCTGTCGAACACAGTGGTACCTGAGGATCGAgAAAAGAGGCAAAS" 

> ■ 1 ■■,■■»: • ■■■■ i — ► 360 




7AAT7ACAATATCATGGAAATCAGGACAGTGGCAG- 

» » ' '» » 420 



XGTQ EMX MNYNXKS Z R T V A V 

-TTGGAATTGTGGCAATCAAAC GGC TGGAAAgTCAATTCTATCTTGCA^ 

i 1 , ■ . ' ■ » ■ ; 1 . , » ■ ).. .■■ » 48O 

GZVAZRGVESE PTL A MNKEG 
-GAAAACTC7ATGCAAAGAAAGAA?GCAATGAAGATTGTAAC77CAAAGAAC^ 



540 




NHYHTTASAXW THHGGEMFV 
-' i! IQCCT TA AA T CAAAA G QGG A T T CC IGX AAGAGGAAAAA A AA CG AACAAAGA A CAAAAAA- 

A L M Q X G Z P V R G X X T X X E Q X T 
-C A fiC C CX L TTTCT ICCi ATGGCAATAACTTAATTGCATATGGTAXATAAAGAA CCCAG TT 



660 



720 



AHFLPKAIT* 

3 * GGATACCGTT ATTGAATT V 
I— OLIGOI26 1 
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(continued) 

HXAGCAGGGAG AI I rCTTTA AGTGGACTCT T I rCTTTCT r CTCAAA AT i ; id 1 1LL1 U 
f » » " *- ♦ 780 

- TATTTTTTACTAATCXAGAAAGGCTGGAAAAACTACXGAAAAACTGATCAAGCTGGAC^ 
> ■ ■ > ■ ■ » ■ «. 

3 'ACCTGAA- 
I— — 

-CTGCA3TTATGTTTGTTTTAAG 3 f 

1 . »■ . 862 

-CACCTAAATACAAACAAAA V 
OLXGO»2 I 
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' f ignr# 3 

RSH-KG? 

plaamid DNA CUI Xb«X j***; 

3«qu«JC» 5 • •ATCGATTTGATTCTAGAAGGAGGAATAACATATGAAAAAG- 

M K K 

R3H signal a«qu«nc« HluX 
-CGCSCACGTGCTATCGCCArrGCTGTGGCTCTGGCAGGTTTCGCAACTAGTGCAC^^ • 
R A R A I A t A V A L A G F A T S A H A - 

MZ»X 

5 • CGCSrTGCAATGACATGACTCCAGAGCAAATGGCTACAAATCT 

— » — > 1 ■ " l r*T * 60 

- CKDMT PEQ MATK V KCSS PE 

-GCGACACACAAGAAGTTATGATTACATGGAAGGAGGGGATAT 

— — t h ■ ! ■ + 12 Q 

RKTRS T D TKEG G O Z R V R R L T 

jqpwx ci*x 

-CTCTCtlAACACAOTGGTACCTGAGGATCC^ 

1 1 . 1 i — »~ ► 180 

C R T Q « T L R Z 0 X R G X V JC G T Q.E 

*GATGAAGAA7AATTACAATA7CATGGAAATCAGGACAGTGGCAGn 

: ■ 1 I 1- ► - ■ H 240 

M X K K T N I M E I R T V A V G I V A I 

reoRX 

•"CAA A GGGGTGGAAAGTGAATTCTATCTTGCAATGAACAAG G AAGGAAAACTCT 
> ■ ■ ' »■■■■ > + ► 3flo 

RGVESErTI.AMKR EGR Z.rAX 



-^AAAGAATGCAA?GAAGArrCTAACr?CXAArUAC7A 

■ ■ 1 > »■ ' » ■ ■■ » 360 

K B C H E D CM F R E L Z & S K H TNT 

JfctoX 

- A7ATGC U C A GC TAAAT G GACACACAACGGAGG G GAA ATG TTTGT rG CCTTAAATCAAAA- 

- > » ■ » ■ > ► ■ l t> 420 

XA 3AKWTH KGGEMFV ALKQ X 

HS GSGA TT CCTGXA MafiGAAAAAAAA CS AAGAAAGA ICITC C- 

l ■ ' ■ » — ► 480 

G Z PVRGXXTKKSQKTAHFXi? 



-TAIGGCAAXAACTTAATAG 3* -plumid OKA 

> > 503 -3*qu*nc* 

M A Z T • * 
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KG? 

HdmX 

MC M DMTPCQMA TNVN C S S P £ " 

^SACACACAAGAAgTTATGArrAaiT( X UACGAGGGG^ . 

R H T R S. T O Y M E • G G D I R V R R L T 12 ° 

ilfr flX ClaX 

•^GT CG AaCACA G TGGTACCTGAGGATCGAT A AJUtfa ra reAAACT 

C R T Q * t t R I 0 X R a X V X G T Q V ^ 

H^TGAAGMTAARACAATATCATGGAAATCAGG*^^ 

— — -» 1 > i »..„ + 240 

M K M N T H I M E X R T V A V G I V A I 

ecorx 

-caaaggc»3tggaaagtgaatrctatcttgcaatgaacaaggaaggaaaac?ct 

»- 1 » + 300 

X G V E S S F T L A M M X S G X X T A K 



HSAAACAATCCAATGAAGATTGTAACTTCAAAGAJUrtA^ 

-s -H ' 1— — » ► 360 

X E C N E 0 C » F K E L Z L E H H T M T 

tfd*X 

~ AJ ATGCATCAXSCT AAATgGACACACAACGGAGGGGAAA TG I T T O I TG CCTTAAATCAAAA- 

— — * I - : * ■ " ' ■ 420 

T A 5 A K W T H N G G E M f V A L H Q X 

"^gSS»TCCTGTA A C A GG^ A AAAAA ACG AA G A 

1 »— I » i > 480 

G 1.1,7 R G S X T X X E Q X T A H T L 9 



-TATGGCAATAACTTAASXG 3* 

— »■ »— 503 

M A Z T • 



i 



WOW113S2 

v 8/15 
Tigwe 5 



substitution ot Tpnl to EcoRI sequence to make KGF(dsd) 
XpnX 

\ ■ ■ OLIGOf 6 — I I --■ -—01,1(50 #7—— 

5 9 CTGCGT ATCGACAAAC GCGGCAAAGTCAAGGGCACCCAAGAGATGAAAA^ - 
3 * CATGGACGCATXGCTGTTTGCGCCGTTTCA GT T CCCGTG GGTTCTCTA CT 1 1 1 rGTIG AXGTTA- 

| 1 0IIG0I9 1 I— Ot»IGO#lO 

-LRIOKRGK VKGTQ EMKHNTN - 

zcoitr 

It — - 0LXG0I 8 — ■ 1 

5 * -AXTAXGGAAAXCCSTACTGT TGC rGTTGGTATCGTTGCAATCAAAGGTGTTGAATCTG 3 ' 
3 f -TAAIACCTrTJU3GCAIGACAACGACAACC^ 5' 

■ - 1 I — — 0LXG0#11 1 

I M C I R t V A V G I V A I K G V E 3 E - 



PCT/US9S/13099 



WOW11952 FCT/US95O3099 

' " . . - 9/15 

KGF (codon optimized) 

|— -OtXGO#12 1 

5 1 AGTTTTGATCT AGAAGGAGG 3' 

| ; OLIGO»14 

5'AGTTTTGATCTAGAAGGAGGAATAACATAXGTGCAACGACAT^^ 



*l I : 0LIGOU5- 



-ACCAACGTTAACTGCTCCAGCCOaUACGTCACACCC^ 

3* CGS CCTTC OSTC Tg gCCAT 3 ' 
. I — — OLXG0#2P 1 

; — otiGO#ia ~ — — r i- 

-GTGACATC CG X C TT CGTCG I C X G r i' C I G CCGTACCCAGTGGTACCTGCGTAgCGACAAA C G- 

3 - ATAGCTGTTTGC- 
|-OLXG0#2I~ 



-CLXGOUS- 



-TGGTAAAGTTAAAGGTACCCAGGAAATGAAAAACAACTACAACATCATGGAAATCCGTACT- 
-ACCATTTC S f 
1 



3* ACGTTAGX I ICCA CAACTTA 3* 
I — 01IGO#22— I 



-AAGAAGGTJ 

3 •GAAGTTTCTTGACTA- 
I— -OLIGO#23 



-0LXG0M8- 



-CCTGGAAA A CCACTACAACACCTACGC A T C T GCIA AAT 
-GGACC 5* 



-0LIG0#19- 



rGGTAAAAAAACCAAAAAAflAACAGA- 
3' QBXCtt tCCATAGGGCCAAfl 3 f 
|— OUGOI24 1 

■QLXWX 9 ■ : - 1 



-AAACeOCTCACTTCCTGCCSATGGCAATCAC?TAATAGGATCCACTTTTGA 3' 

3' AATTATCCTJU3GTCAAAACT 3* 
I — 01XG0#13— I 
Buffi 



PCT7US95O3099 
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Figure 7 
KGF C(1,15)S 

5 1 ATGTCTAATGATATGACTCCGGAACAGATGGCTACCAACGTTJ 

M5N0MTP S QMATHVNS S S 9 Z 

H35TauaU35CSTTCCTACra 

■ ■> » ■>■■ » : 120 

R H t RS TO TMEGGD Z RVRRL P 

-TCCCCTACCC*STGGTACCtgCSTATCg*C*AA C SCGG 

i » ■ I » ► 180 

CRTQWTLRIDKRGKVKGTQE 

^ATGAAA A ACJUl C TACAATAyTATGGAAAJ CCG T ACTGT T GCrGT T G CT ATCCT I G CAAtC- 

» ■ ■ ■ ' • 1 ■ + 240 

M K N N T N X H Z I R T V A V G X 7 A X 

-AAX GGTG XU ^a AAgCTGAAtTCTACCrGGCAATGAACAAAflAAGGTAAACTGTACGCAAAA* 



K G V Z 3 EFTLAM NKS G1CLTA K 
"AAAGAAJGCAACGAAGACTGCXACTTCAAAGAACTGATCCT^ 



300 




360 



-+ 420 




rAAAAAJUCCAAAAAJUSAACAGAAAACC GC T C X C IT CC X G CCG- 
1 i >■ h ♦ 480 



GXP VRGKKTKKEQKTA HrtP 



-ATGGCAATCACTTAA 3 • 

1— 495 

M A X T • 



WO 96/11952 



PCT/US95fl3099 
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Figure 8 
KGF R(144)Q 

5 9 ATGTGCAXXGAT ATGACTCCTGAACAAATGGCT A 



M C K 0 M T P S Q M A T H V N C S 3 P C 



60 



rACGATTACATGGAAGGTGGGGATATTCGCGTACGTCGTCTGTTC- 



R H T R S Y'DYMEGGD X R V R R L r 
«»TGCCGTACCCACTGGTA£CTGCGTATHi AT X X A CGC G GCAAAST CA lff rc TO k TrCA AGAfr 



120 



C R T Q W Y L R X 0 X R G X V X G T Q £ 
- ATGAAAAACAACTACAAXAXTATGGAAATCCGTA C T GI T GC rGT T GGTA XCGTTGCAATC- 
M X N M Y M X M S X R T V A V G X V A X 



180 



240 



-AAAGGTGTTGAATCTGAATTCTATCTTGCAATGAACAA^ 
X G V 8 S t F Y L A M M X E G X t T A X 



300 



-AAA6AATGCAAXGAAGArrt7TAACTTe*AAGAACTAATTCT 

» ■ ■ - ," ■ 1 > 1 » ► 360 

X E C M E DC M F X E L £ X* E H H Y M T 



420 



Y A S A X W T H M G G E M. F V A X» H Q X 

HjGTATCCCTGTICAAGGYAAGAAAA CC AACAAAGAACAGAAAACCgCTCACT^ 

G X P V Q G X X T X X E Q X T A H F L P 

-ATGGCAATCACTTAA 3* 

1 495 

M A X T * 



480 



WO 96/11952 PCT/US95/13099 

. W\S ~ 

Figure 9 
KG? CU,15)S/RU44)Q 

3 • ATCTCTAATGAgAtGACTCCGGAACASATGGCTACCAACS^ 

'." ' 1 1 * » ■■■ -+ 60 

NSHDMTPSQMATNVHSSSPt 

-CGTCACXC G C G TTCCTACGACTACAirGGAAGCT 
I i i h- k 120 




— 1 1 — r * ' ■» 180 

C R 7 Q If T L R X 0 X R G X VKG T Q £ 

-AgGAAAAACAACTACAATATTATGGAAATCCSTACTGl rGCTCTTG CTATCSTTGCAATC- 

>■ " 1 1 »■■■»■■ ■ ■ i ■■ » ;+ 240 

M X If .If T M X M S X R T V . A V G I V A I 

4 

~AAAflGTGTTGAA7CTGAATTCTATCTTGCAA 

: : > > 1 ► 300 

K G V E S E r T LAHN K S G KL YAK 

~AAAGAATGCAATGAAGATTGTAACTTCAAACAACTAAgTCrGGA A AA CC AgT A CAACACA- 
1- — ^ — ' » : > i -+ 360 



420 




T A 3 A X W 7 H M G G S M T V A t H Q R 

*GGXAgCCCTGTTCAAGGTAMAAAA CC AAfiAAAfiAACJUSAAA ' 

> ' ■ ■ * ■ » ' » ■ ■ \ ■ ■ ► ■ ■ ■ ■ ■ 

G X PVQ GKXT X KEQXTAflPLP 

- AtGGCAATCACTTAA 3 9 

« 1 — » ■ ■ ■ 493 

M A X T • 



480 



WO 9*11952 



FCT/US95O3099 
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Figure 10 
KGF 

5 ' A WTCS rCrCC T G AACCTC*TJU5»5TTC^^ 



1 » H ► ► ■ > «o 

M S 3 V B R H T ft - 3 T O X M 8.0 G O t ft 

^JlC Gro CTCT CT TC TGC CCTACCaU^ 

■» 1 ■» > ■» ► uo 

VRRLfCftTQlfrLRIOKRO KV 



- AAGGGCACCCAAGAG AXGAAAAACAACTACAATAX7ATGGAAATCCGT A 
— ■ > ■ ■ ■■ I ■ ■ ► ■■ ■ ■ 

KG7Q8MXHH XHZM8 Z R T V A V 

KCTATCGTTGCAATCAAAGGTGTTGAAgCTGAATTCT^^ 



180 




240 



300 



KLTAKKECNEDCMTKC 
- AACCACT ACAACACCT ACGCATCTGCT A 



X Z* E 



STTCGTT- 

360 



HHYNTTA3 AKWTHMGG SMFV 



rAAAAAAACCAAAAAAGAACAGAAAACC- 
— ^ » I 420 



ALMQXGZPVRG X B_T X X £ Q.X T 



-GCTCACTTCCTGCCGAXGGCAATCACTTAA 3' 

» ■ ■ I 450 

AHFLPHAIT • 



WO 9*11952 ,4/15 PCI7US9S/13099 

Figure 11. — — 

KGF AN23 

5 • XS Q rCC yACGACTACATGGAAGGTGGTGACATCCGCSTAC G T CG T C tSTTC X UCCU T A CC* 

I > 1 I : ► 60 

MS.XDYMEGGD I R V R R L F C R T 

H^GTGGTACCTGCGTAXCGACAAA^ 

1 1 r» ► uo 

Q W X L R X 0 X R G X V X G T Q E M X N 
-AACTXCAATATTATGGAAATCCGTJ 



rATCGTTGCAXrCAAAGCTGTr- 

1 t ■■ ■■■■ ■ — > ■> 1 ' ■ — » X80 

M X N X M £ X R T V A V G I V A X X G V 

-GAATCTG3Uk!TTCTACCTGCKSAATGAAC A AAgAAGCT 

I ■»■' ■■ » ■ » ■ 240 

E S t T X L A M M X SG X L T A X X S C 

-AACGAAGACTGCAACTT C AAAGAACTGAT CCT 



300 




-GCX 



A X W TH U G G E M F V A I» MQ XG I P 



V RG X X T XXE Q XT A H F L P MA X 



>+ 420 



-ACTTAA 3' 
426 



WO 96/11352 



: - Asm 

Figure 12 
KGF AN23/R(144)Q 



PCT/US9S/13099 



5 • ATOTCCTACGACTACATGGAAflG ' TGGTGACAgCCGCgTAC G T CG tC TGWC IGC C GIA CC * 
M 3TDXMS GG0IR VRRX.PC R7 6 ° 




120 



H T M X H X I X T V A V G I V A I X 6 V 
■^flATCTGAAJTCTATCTTGCAATGAACA iffi* 1 1 irrgfiTfy AAAfi AAAGAATGC 
X S S *r T Z» A M V X t G X X, TAX X E C- 



+ 180 



240, 



-AATgAACATTGTAACTTCAAAgAACTAATYCTOTAAAACCACT 
1 » »' » — m ► 300 



3(0 




A X If T H MG G E M f V A L Bf Q X G X 9 



"GTTCAAGGT AA gAAA A CC A AG AAAfi A ACAGA A AAC CGCT CA Ci JCL IIjUJ GAIGGCAATC- 

> ■» I i 420 

VQGXX7XKSQXTA:HFL?MAZ 
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